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Protection of Plants 



OBJECT OF THE INVENTION 

The present invention describes a new species of binucleated Rhizoctonia suited for use in plant 
protection. 



SUMMARY OF THE INVENTION 

The present invention describes a new species of binucleated Rhizoctonia able to protect different 
kind of plants, like tomatoes, onions, grass, pine trees, and others, from infections produced by 
different pathogenic fungi among them Fusarium, VerticiUiim, Phythophtora and pathogenic 
Rhizoctonia, as well as yeast and pathogenic bacteria and the appropriated methodology for the 
molecular identification of this new species, its detection based on probes specially designed and 
its practical application for the protection of crops. 

Thus the present invention uses the ITS data to distinguish isolates and in particular to identify 
pathogenic and candidate protective isolates belonging to the binucleated group. 

The invention also provides specific PGR primers for the ITS molecular analysis of pathogenic 
and protective strains of Rhizoctonia. 



PREFERRED EMBODIMENTS 

The protective species can be utilised in a form of powder obtained after fermentation of the 
micro-organisms with subsequent drying of the filtered broth , applied to the seed before seeding 
or applied to the soil before seeding or both , and will preclude the growth of the pathogen 
Rhizoctonia protecting the plant from the infection. 



In the present invention, the ITS sequence is employed as a marker to distinguish strains and 
correspondingly classify the pathogenicity of isolates of Rhizoctonia. The ITS information 
enables phylogenetic location of protective isolates, and design of specific primers for the 
identification of new isolates. 



SUT^IMARY OF FIGURES 

Figure 1 is a schematic representation of the Internal Transcribed Spacer (ITS) region of 
ribosomal DNA of Rhizoctonia. 

Figure 2 is a phylogenetic tree for the results obtained in the Examples. 

Figure 3 shows photographic evidence of the protective effect of RBN isolates B and F2 on 
tomato seedlings. 

3. a.- Shows the plates corresponding to the tomato seedling as negative control and the plate 
with tomato seedling with the pathogenic strain Me 8-2. 

3.b.- Shows the negative control with the protective isolates without pathogen. 
3.C.- Protective effect of F2 on Me 8-2 action, both simultaneous and subsequently. 
3.d.- Protective effect of B on Me 8-2 action , both simultaneous and subsequently. 

EXAMPLES OF THE INVENTION 

In this study we sequenced the ITS - 5.8S regions of 13 binucleate isolates of Rhizoctonia isolated 
from Albacete (South-East of Spain), 9 binucleate isolates from CBS (Central Bureau 
Schimmelcultures, Baarn, Delft, Netherlands) collection, 3 multinucleate isolates from Albacete 
and multinucleate isolates belonging to AG 2. The later multinucleate isolates were included to 
determine the sequence variation in rDNA regions and to assess this variation as a possible tool for 
identifying groups into Rhizoctonia spp. More generally, the isolates from collections were 
included for comparison purposes and completeness of the phylogenetic tree. 




Isolation of Rhizoctonia spp. from the field. 

Samples of saffron, alfalfa and pine seedlings were collected at different locations in Albacete. 
South East of Spain. Isolation of Rhizoctonia spp. from these samples was performed by washintj 
saffron roots, alfalfa stems and roots and whole pine seedlings with tap water to remove adherins 
soils particles and plating plant segments on 2% water agar amended with chloramphenicol (250 
mg/ml). The cultures were characterised as Rhizoctonia by observing hyphal and colony 
morphology and vegetative features microscopically, such as hyphae with dolipore septa, basally 
constricted branching near the distal septum of cells, and absence of clamp connections and 
conidia. Cultures were maintained and stored at room temperature on potato-dextrose-agar [PDA, 
Oxoid LTD, U.K.]. The number of nuclei per hyphal cells was determined using a staining 
method described by Julian eta/. (Julian et ai, 1997). Briefly, 3 mm" agar plugs of inoculum 
were placed in petri dishes containing liquid complete medium [CM: 5 g malt extract (Merck, 
Darmstadt, Germany), 5 g yeast extract (Difco, Detroit, USA) and 5 g glucose anhydrous (Merck, 
Darmstadt, Germany) per litter] and incubated for 3 d at 230 C in the dark. The resulting 
mycelial colony was transferred to a microscope slide, excess medium was removed and a drop of 
50 mg/ml Hoechst 33342 (Molecular Probes Europe, Leiden, The Netherlands) was added. Mitv 
a 15 min incubation in the dark, the staining solution was removed and 50% glycerol was added. 
A coverslip was placed on sample and the number of nuclei was determined by fluorescence 
microscopy using an Olympus BHS microscope supplemented with a BH2-RFC reflected light 
fluorescence attachment (with dichroic mirrors BH2-DMU for UV excitation). 

Other Rhizoctonia isolates 

Binucleate Rhizoctonia spp. isolates have been obtained from the CBS collection and are listed in 
Table 2. 



Table 1. Original isolates described and sequenced for the first time in the invention 



Isolates 


Nuclear condition 


Identification 


Isolated from 


Geographical origin 












FI 


Binucleate 


Unkno\\n 


Saffron 


Albacete (Spain) 














F2 


Binucleate 


Unknown 


Saffron 


Albacete (Spain) 












F3 


Binucleate 


Unknown 


Saffron 


Albacete (Spain) 












F5 


Binucleate 


Unknown 


Saffron 


Albacete (Spain) 












F6 


Binucleate 


Unknown 


Saffron 


Albacete (Spain) 












F7 


Binucleate 


Unknown 


Saffron 


Albacete (Spain) 












SI 


Binucleate 


Unknown 


Saffron 


Albacete (Spain) 












S2 


Binucleate 


Unknown 


Saffron 


Albacete (Spain) 












S3 


Binucleate 


Unknown 


Saffron 


Albacete (Spain) 












S4 


Binucleate 


Unknown 


Saffron 


Albacete (Spain) 












S5 


Binucleate 


Unknown 


Saffron 


Albacete (Spain) 












S6 


Binucleate 


Unknown 


Pinus pinaster 


Albacete (Spain) 












S7 


Multinucleate 


Unknown 


Pinus pinaster 


Albacete (Spain) 












Al 


Multinucleate 


Unknown 


Alfalfa 


Albacete (Spain) 












A2 


Multinucleate 


Unknown 


Alfalfa 


Albacete (Spain) 












B 


Binucleate 


Unknown 


Alfalfa 


Albacete (Spain) 



Table 2 Reference isolates belonging to the Binucleated Group obtained from different collections 



Isolates 


Nuclear Condition 


Identification 


Isolated in 


Geographical Origin 












CI 


Binucleate 


Ceratobas. Comigerum 


Festuca sp. 


Pennsylvania USA 












C2 


Binucleate 


Cerarobas. Comigerum 


lead of Pittosporum 


Pennsylvania USA 













C4 


Binucleate 


Ceratobas. Comigemm 


Junipems sp. 


North Carolina USA 












C5 


Binucleate 


Ceratobas. Comigerum 


Taxus sp. 


Rhode Island USA 












C6 


Binucleate 


Ceratobas. Comigerum 


Erigeron canadiensis 


Pennsylvania USA 












C8 


Binucleate 


Ceratobas. Comugerum 


Pittospomm sp. 


Florida USA 












Cll 


Binucleate 


Ceratobas, Cerealis 


Secale cereale 


Wageningen, NL 












C12 


Binucleate 


Ceratobas. Cerealis 


Secale cereale 


Gemiany 












C13 


Binucleate 


Ceratobas. Cerealis 


Triticum aestivum 


Germany 












17-329 


Binucleate AG- A 


Ceratobas. Corgigerum 


Strawberry 


Saga. Japan 












17-326 


Binucleate AG-A (CAG-2) 


Ceratobas. Comigerum 




USDA 












16-321 


Binucleate AG-BO 


Ceratobas. Comigemm 


Patata 


Tokushima, Japan 












17-328 


Binucleate AG-B6 






USDA 












16-320 


Binucleate AG-C 


Ceratobas. Comigemm 


Gimnadenia conopsea 


Hokkaido, Japan 












16-319 


Binucleate AG-D 


Ceratobas, Gramineum 


Wheat 


Hokkaido. Japan 












4-7 b 


Binucleate AG-E 


Ceratobasidium sp. 


Flax 


Hokkaido. Japan 










— 


17-331 I 


Binucleate AG-G ( 


"eratobasidium sp. 


Peanut 


Chiba, Japan 












16-315 1 


Binucleate AG-I I 


Jnknown Artemisia 


rokaydo. Japan 














16-314 1 


Binucleate AG-J ( 


"eratobasidium sp. i 


5oil ] 


<anio. Japan 












16-317 E 


Binucleate AG-K C 


reraiobasisium sp. ( 


3nion I 


■iokkaido. Japan 














.9-363 


Binucleate AG-L 


Ceratobasidium sp. 


Soil 


Fukuoka. Japan 












19-368 


Binucleate AG-N 


Unknown 


Soil 


Tochigi. Japan 












19-367 


Binucleate AG-P 


Ceratobas, Comigerum 


Tea 


Shizuoka T;inAn 












19-365 


Binucleate AG-R 


Ceratobasidium sp. 


Cucumis 


Georgia. USA 












19-366 


Binucleate AG-S(CAG-7) 


Ceratobas. sp. 


Pinosporum 


Florida. USA 













Table 3 Isolates of multinucleated Rhizoctonia solani AG-2. 



Isolates 


Nuclear condition 


Identification 


Isolated in 


Geographical origin 












Jl 


Multinucleate L-9-IS 


Thanatephorus cucumeris 


Zoysia grass 


Gifu Jj^nan 












J2 


Multinucleate G4 


Thanatephoms cucumeris 


Zoysia grass 


Gifu Janan 












J3 


Multinucleate AJ-1-10-1 


Thanatephorus cucumeris 


Zoysia grass 














J4 


Multinucleate RGR38 


Thanatephorus cucumeris 


Zoysia grass 


Shizuoka, Japan 












J5 


Multinucleate RGR 39 


Thanatephorus cucumeris 


Zoysia grass 


Shizuoka. Japan 












J6 


Multinucleate SLP3-3 


Thanatephorus cucumeris 


St. Augustine grass 


Okinawa. Japan 












J7 


Multinucleate RGR-59 


Thanatephorus cucumeris 


Bent grass 


Shizuoka. Japan 












J8 


Multinucleate C-116 


Thanatephorus cucumeris 


Mat rush 


Kumamoto, Japan 












J9 


Multinucleate C-306 


Thanatephorus cucumeris 


Mat rush 














J 10 


Multinucleate 9207 


Thanatephorus cucumeris 


Beefsteak plant 


Hyogo. Japan 












Jll 


Multinucleate No, 2 


Thanatephorus cucumeris 


Maize 


Miyazaki. Japan 












J12 


Multinucleate Gu-1 


Thanatephorus cucumeris 


Guzmania 


Gifu. Japan 













J13 


Multinucleate K-1 


Thanatephonis cucumeris 


Sugar beet 


Hokkaideo. Japan 












J14 


Multinucleate S-2 


Thanatephonis cucumeris 


Sugar beet 


Hokkaido. Japan 












515 


Multinucleate R56 


Tlianatephonis cucumeris 


Sugar beet 


Hokkaido. Japan 












J16 


Multinucleate R64 


Thanatephonis cucumeris 


Sugar beet 


Hokkaido. Japan 












J17 


Multinucleate 92155-1-2 


Thanatephonis cucumeris 


Burdock 


Hokkaido. Japan 












J18 


Multinucleate 92155-1-3 


Thanatephonis cucumeris 


Burdock 


Hokkaido. Japan 



Pathogenicity assays 

Pathogenicity tests were performed with the strains isolated from the field in Albacete on the 
following plant species: radish {Raphaims sativus), cauliflower {Brassica oleracea var. botry tis), 
onion {Allium cepd), turf grasses: zoysia grass {Zoysia tenuifoUd), St. Augustine grass 
{Stenotaphriim secuudatum) and Bermuda grass (Cynodon dactylon), tomato {Lycopersicum 
escidenturn), carrot {Daucus carota) and cucumber {Cucumis sativum). Seeds were surface- 
disinfected in 1% sodium hypochlorite for 10 min, rinsed with sterile distilled water and 
aseptically blotted between two layers of wet sterile paper into petri dishes. They were allowed to 
germinate for 4 days at 260 C in the dark. Isolates were grown on water agar for 2 days at 260 
C. Five germinated seeds were then placed around the fungal colony. The plates were 
incubated at room temperature in the lab bench, and pathogenicity was assayed for the first time 7- 
10 days after the tests were set, and subsequent observations were made afterwards every uvo 
days. The Rhizoctonia solani AG 4 strain Me8-2 (Boysen et al, 1996) was used as positive 
control, since this strain is known to be very pathogenic to all plant species tested. A negative 
control of germinated seeds without fungus was included in each of the tests. 



Protection assays 



The strains that did not show pathogenicity towards one or more plant species were tested for 
protection from the pathogenic strain Me 8.2. This was done by inoculating both the pathogenic ^ 
strain and the supposedly protective strain. This was done in two ways, simultaneously (both 
strains inoculated at the same time) and successively (first the supposedly protective strain was 
allowed to colonise the plants surface for three days and then the pathogenic strain was 
inoculated). A negative control (no fungus), a positive control (the pathogenic strain Me 8.2), 
and the supposedly protective strain alone were included for each of the plant species in the tests. 
Protection was assayed for the first time 7 days after the tests were set, and subsequent 
observations were made every two days. 

Isolation of genomic DNA, asymmetric PCR and DNA sequencing procedures 

For DNA extraction, Rhizoctonia plugs were inoculated in 50 ml complete medium [CM: 5 g malt 
extract (Merck, Darmstadt, Germany), 5 g yeast extract (Difco, Detroit, USA) and 5 g glucose 
anhydrous (Merck, Darmstadt, Germany) per litre] in a flask, the mycelium was grown for 4 days 
at 23° C and 150 rpm agitation, blotted dry and freeze dried. After grinding, mycelium was 
suspended in 500 ml DNA-extraction buffer [20 mM Tris/HCl pH 8.0, 50 mM EDTA and 0.4 % 
(w/v) SDS], incubated at room temperature for 5 min and extracted twice with an equal volume of 
phenol: chloroform: isoamylalcohol (25:24:1). Subsequently, the aqueous phase was incubated 
for 30 min at 37° C in the presence of 50 mg/ml RNase A and then extracted with chloroform: 
isoamylalcohol (24: 1). DNA was precipitated by adding 0.6 volumes isopropanol, washed with 
70% ethanol, dried and resuspended in 100 ml ultrapure sterile water. 

Eucariotical ribosomal genes are arranged in a tandem repeat with the 5.8S gene flanked by 
internal transcribed spacers (ITS). Fig 1 is a schematic representation of the Internal Transcribed 
Spacer (ITS) region of the ribosomal DNA. The shadowed boxes represent the ribosomal genes. 
The arrows denote the position of the PCR and sequencing primers. 

The regions of rDNA repeat from the 3' end of the 18S gene to the 5' end of the 28S gene were 
amplified using the primers ITS 5, 4 , 4B and IF (White et al. Gardes et al). Asymmetric PCR to 
synthesise ssDNA of the ITS region was done with a 50: 1 molar ratio between two primers 
(Gyllenstein). Fragments of approximately 700 bp were obtained when using Rhizoctonia 




jenomic DNA as a template for PGR amplification of the ITS region. Different combinations of 
I primers were tested for the asymmetric amplification. Asymmetric PGR products using 
ITS4/ITS5 in the ratio 50:1 were found to give better results when sequencing with ITSl and 
ITS3, whereas the combination of ITS1F/ITS4B (also in the 50:1 ratio) was found to give better 
results when sequencing with ITS2 and ITS4. Manual sequencing was performed using the 
Sanger's dideoxy method (Sanger et ai^ 1977). 

Phylogenetic analysis 

The sequence data were checked between complementary strands using GCG Fragment Assembly 
System (Program Manual for the Wisconsin Package, version 8, Wisconsin). The resulting 
sequences were aligned using the software package GLUSTALW (Thompson et at,, 1994) 
(IntelliGenetics, Inc., Mountain View, CA). All the data were processed for phylogenetic 
analysis of the nucleotide and gap differences. The distances in the ITS-5.8S rDNA region were 
determined by Kimura two parameter model (Kimura, 1980), omitting all sites with gaps. 
Phylogenetic trees were constructed using the maximum-parsimony analysis with the heuristic 
search of the Phylogeny Using Parsominy Analysis (PAUP) program 3.1.1. (Swofford, 1991) 
which gaps treated as missing data. The branch and bound algorithm provides an efficient 
approach to find all the minimal lengths. The trees were rooted with the rDNA sequence of 
Trichoderma harzianum. A bootstrap analysis using 1000 resample of the sequence data was 
carried out, and using the heuristic search option of PAUP. Trees were also obtained by using the 
Neighbour-joining method (Saitou and M,, 1987) as it is in the program "Neighbour" from the 
PHYLIP package version 3.5 (Felsenstein, 1993): The topological accuracy was again estimated 
by the bootstrap method with 1000 replicates. The percentage of bootstrap replicates which 
yields each grouping as a measure of statistical confidence. A grouping found on 95% of 
bootstrap replicates was considered statistically significant. 

RESULTS 



Pathogenicity assays 




rmcifers (radish and cauliflower) are susceptible to all Rhizoctonia isolates tested, regardless of 
whether they were multinucleated or binucleated. The same was true for onion. Grass, tomato \ 
and carrot were susceptible to all multinucleated /?/7/roc/c?/?/ae (Me 8.2, aAl, aA2), but were not 
susceptible to any of the binucleate strains, therefore protection tests were done with these plant 
species. The strains randomly chosen to test protection from the pathogenic strain Me 8.2 were 
the binucleate strains aB and F2. Cucumber was susceptible to all strains tested, except for S7. 
therefore this strain was subsequently tested for protection from the pathogenic strain Me 8.2. 
The strain Me 8.2 showed significatively stronger pathogenicity than any other strains tested, and 
this was seen in shorter times needed to attack the plant, and complete killing of the seedlina, 
whereas with other pathogenic strains the seedlings were damaged, but not as much affected as 
with isolate Me 8.2. In the pathogenicity tests was also observed that in some cases the seeds had 
not been completely surface sterilised, and in the negative control (no fungus) was observed 
growth of some fungi other than Rhizoctonia, yeast or bacteria, but when Rhizoctonia had 
colonised the seedlings, there was no growth of these contaminants. 



Protection essays 



For grass, tomato and carrot, it was observed that the negative control (no fungus) presented 
contamination from other fungus, yeast and/or bacteria, whereas the control with protective strains 
B and F2 were clean, that is, Rhizoctonia did not allow any of the contaminants carried in the seed 
to grow. As expected, plants inoculated with protective strains B and F2 looked completely 
healthy, whereas plants inoculated with Me 8.2 were heavily injured. In the simultaneous test 
(that is, protective and pathogenic strains inoculated at the same time) there was no protection of 
the binucleate strains from the pathogenic Me 8,2. But in the successive test, in which the 
binucleate strains B and F2 were allowed to colonise the plant surface prior to adding the 
pathogenic strain, a very clear protective effect was observed, and the plants looked as healthy as 
the negative control and the control with the binucleate strains alone (Fig. 3). Furthermore, it was 
also obser\*ed that the seedlings covered by the protective strains looked slightly bigger than the 
negative control, thus pointing to the possibility that there is an increased growth response induced 
by the non-pathogenic Rhizoctonia isolates (Fig. 3). This has already been described by (Sneh ^/ 
a/., 1986). In cucumber, although some protection was seen, we could not observe a clear 



protective effect of strain S7, therefore we could not conclude that strain S7 acts as a protective 
I strain in cucumber. 

We foresee the same protective behaviour foe the entire group of the new species including F2. 
protects much better than Al and A2 which are pathogenic. 

Sequencing PCR products from the PCR amplified ITS regions 

PCR experiments carried out on the ITS0.8S rDNA region of 20 isolates of binucleaie 
Rhizoctonia spp (including isolates of Ceratobasidium from CBS collection and RBN testers) A 
single band was detected in each amplification reaction when analysed by agarose gel 
electrophoresis. The amplified DNA was sequenced directly in both strands. 

Primers obtained from the sequenced isolates of binucleated Rhizoctonia for their identification 

Albacete 
Ab-1 

CCCTATTAAGGGGCA 

Binucleated 
Bin 

CCGTAAAAAAGTCTT 

Cerealis 
Cerll 

CGCGAGAGAGAGGCTGGC 

Albacete 
Ab+ 

CCACACAAACCCATTGTATT 



Cornigerum 



-org 

AATTGTTCCTTTGGGCAC 

Albacete 
Ab-2 

GGAACCTTTATTGGAC 

Cerealis 
Cerl2 

AATGTAATCGATGTAAACGC 

Albacete 
Ab-3 

CTCAATTTATTTTAAAACGA 

The molecular definition of the members of the new species group may be based on several 
specific sequences of the ITS segment. The sequence CC ACAC AAACCCATTGTATT of the 
primer AB+ is common and specific to the group of F2. with the exception of S6 that changes on 
nucleotide (ATCG instead of ATTG), see pages 3-4 of the manual alignment. 

This primer may define the group whereas we cannot define the isolate B. This isolate has a 
unique property of having cells with two and with four or more nuclei. This characteristic, neve 
described before may be important for the understanding of the relationship between binucleated 
and multinucleated Rhizoctowa. 

Phylogenetic analysis 

The CLUSTALVV sequence alignments are shown in Exhibit A. 



Reference is also made to the phylogenetic tree of Figure 2. The 5.8S rDNA gene sequence (155 
bp) was conserved among the Rtuzoctoma isolates examined. Among the isolates analysed, the 
sequence of ITS 1 was more divergent than that of the ITS 2 region. There was 1 00-8 1 % 
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.equence homology among the sequences of total ITS regions of all the isolates used in this study. 
Sequence homology among binucleate isolates varied from 99-85%, and 85-75% between 
binucleate and multinucleated isolates (AG 2). In phylogenetic analysis a consensus sequence of 
the ITS regions of 18 AG 2 isolates was used. Phylogenetic analysis separates binucleate and 
multinucleated in two different clusters. In the multinucleated cluster it is possible to 
differentiate AG 2 and AG 4, and also subgroups within each AG that correlate with pathogenicity 
and geographical origin. Binucleate isolates clustered together in an independent branch. It is 
interesting that Spanish isolates from Albacete cluster differentiated from binucleate isolates from 
the CBS collection. Multinucleated isolates from alfalfa (Al and A2) cluster in AG 4 group, with 
similar features of pathogenicity. In the tree, horizontal lengths in the branches indicate the 
relative genetic distances between the isolates. 

Fl, F6, F5, F7, S5, S6, F2, F3 and SI thus comprise the new species so far. S3 clusters with 
reference strains and should be similar to AGN, AGP, AGBO, and AGK. S2 and SS are similar 
to AGL, AGI, AGC and AGD. 
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C JSTALW ALIGNMENT 



c:i6 

92117 
MO 

GUI 
AJllMl 
G4 
RGR39 
L91S 
RGR38 
SLP33 
921S512 
Ki 

9215513 
Sdos 
R64 
R56 
R65 
S5 
5(5 
Fl 
F2 
F3 
F5 
F7 
F6 
SI 

Al 
B 

AGK 
AGN 
AGP 
AGB(^ 
AGBB 
S3 
AGJ 

ack; 

C5 
C8 
C2 
C4 
C6 

c: 

Cll 
Ci2 
CI3 
S2 
S4 
AGL 
AGI 
AGC 
AGA 
CAG2 
AGD 
AGE 
AGR 
AGS 

THARZIA^fU 



GGATAAACTXXTGTAACAAGGT 
GATAAAAGTCCTAACAAGGT 
ATAAAAAGTCGTAACA-^GCTT 
AGTAAAAGrrCtTTAACAAGGT 
AGTAAAACrrCGTAACAAGCTT 
AOrAAAAGTCGTAACAAGGT 
ACTTAAAAGTCCTAACAAGGrr 
GTAAAAAGrrCGTAACAAGCTT 

AGTCGTAACAAGGT 

AAAGTCG- jA.^ AAGGT 

AAAGTCG- A.-jC.^-=dGGT 

AAGTCG - AAC.AAGGT 

AGTAAAAGTCGTAACAAGGT 
AGTAAAAGTCGTAACAAGGT 
AGTAAAAGTCGTAACAAGGT 
AGTAAAAGTCGTAACAAGGT 
AGTAAA^.GTCGTAAGAAGGT 

AAAGTCGTAACAAGGT 

ACTAAAAGTCGTAACAAGCTr 
AGTAAAAGTCGTAACAAGGT 

AACAAGGT 

CTAACAAGGT 

GATGT AACAAGGT 

TCJT AACAAGGT 

ATGT AACAAGGT 

GATGT AACAAGGTT 

AGTAAAAGTCGTAACAAGGT 
AGTAAAAGTCGT-i-i^CAAGGT 
AGTAAAAGTCCrAACAAGGT 

GTCGTAACAAGGT 

GTCGTAACAAGGT 

-TCTTGGTCATTTAa^ySGA AGTAAAAGTCGT.-.s^.^-AGGT 
-TCTTGGTCATTTAC»AGGA AGTAAAAGTCGTAACAAGGT 

- -CTrGGTCATTTAGAGGA AGTAAAAGTCGTAACAAGGT 

- -CTTGGrCATrrA(l=vGGA AGTAAAAGTCGTAACAAGGT 

- -CTTGGTCATTTAGAGGA AGTAAAAGTCGTAAC/'uAGGT 
-TCTTGGTCATTTAaAGGA AGTAAAAGTCGTAACAAGGT 

- -CITGGTCATTTAaAGGA ACT ATT AGTCGTAACAAGGT 



TTAGi^GGGGA 

AGAGGA 

TTAGAGGA 

GGA 

ITCCCrTTTTTAGAGGA 

TTTAGAGCGAA 



CTTGC ,TC ATTTAG^aGGA 

CTTGGTCATTTAGAGGA 

CTTGtTTCATTTAGAGGA 

GCTCTTGGTCATrrAGAGGA 
GCTCTTGCTCATTTAGAGaA 

- - -CTTCKTTCATTTAGAGa^ 

- - TCTTGGTC ATTTAG.iA3GA 



- -TAGAAGTC&T AACAAGGT 



TTCCGTAGGTGAACCTGCGG 

ttccgtaggtgaacctgcgg 
ttccgtaggtg>j>jcctgcgg 
ttccgtaggtgaacctgcgg 

rrCCGTAGGTGA.-CCTGCGG 
TTCCGTAGGTGAACCTGCGG 
TTCCGTAGGTGAACCTGCGG 
TTCCGTAGGTGAACCTGCGG 
TTCCGTAGGTaiiACCTGCGG 
TTCCGTAGGTGAACCTGCGG 
TTCCGTAGGTGAACCTGCGG 
TTCCGTAGGKiAACCTGCGG 
TTCCGTAGGTGAACCTGCGG 
TTCCGTAGGTGAACCTGCGG 
TTCCGTAGGTGi^nCCTGCGG 
TTCCGTAGGTGAACCTGCGG 
TTCCGTAGGTGiJ^CCTGCGG 
TTCCGTAGGTa^uACCTGCGG 
TTCCGTAGGTGAACCTGCGG 
TTCCGTAGGTGAACCTGCGG 
TTCCGTAGGTGCACCTGCGG 
TTCCGTAGGTGAACCTGCGG 
TTCCGTAGGTGAACCTGCGG 
TTCCGTAGGTGAACCTGCGG 
TTCCGTAGGTGAACCTGCGG 
TTCCGTAGGTGAACCTGCGG 
TTCCGTAGGTGAACCTGCGG 
TTCCGTAGGTGAACCTGCGG 
TTCCGTAGGTGAACCTGCGG 
TTCCGTAGGTGAACCTGCGG 
TTCCGTAGGTGAACCTGCGG 
rrCCGTAGGTaV-CCTGCGG 
TTCCGTAGGTGAACCTGCGG 
TTCCGTAGGTGAACCTGCGG 
TTCCGTAGGTGAACCTGCGG 
TTCaTr/vGGTGAACCTGCGG 
TTCCGTAGGrGA.-.CCTGCGG 
TTCCGTAGGTAAACCTGCGG 
CGCGG 

cr^GG 

CTCCGTTGGTGAACCAGCGG 



------- - TT.-\AAC AAAGG ~ ■ 

TTAAACAAA-GG- - 

TT.W.CAAAGG- - 

TTAAAC AAAGG- - 

TTAAACA^i'-^GG- - 

-TTAAAC AA.\GG- - 

TTAAACAAAGG- - 

TTAJ^ACAAAGG- ■ 

TTAAAC A.^=^GG- ■ 

TTAAACAAAGG- 

TTAAACAA-=iGG- 

TTAP^AC AAAGG- 

TTAAAC A^\CG - 

TTAAAC AAAGG - 

TTAAAC AA.-.GG - 

TAAACAAACG - 

TTAAAC AAAC-G - 

TTAAAC AAAGG - 

TTAAAC AAAGG - 

AAGGATCATTATTGAATG- - 
AAGGATCATTATTGAATG- - 
A-\GGATCATTATTaAATG- - 
AAGGATCATTATTGAATG- - 
AAGGATCA.TT ATTGAATG - - 
.AAGGATCATTATTGAATG- - 
AAGGATCATTATTGAATG- - 
iAGaATCATTATTGAATG- - 
AAGGATXIATT ATTGAATG - - 
A^i.GGATCATTATTaAATT- - 
iAGGATCATTATTGAATT- - 
AAGG.ATCATTATTGAATT- - 
.-AGaATCATTATTGAACCAC 
Ar.GGATC-.TTATTGAATCA- 
AAGGATCATTATTGAATG- - 
AAGGATCATTATTa--ATG- - 
AAGGATCATTATTGAATG - - 
AAGGATCATTATTGAATG - - 
AAGGATCATTATTGAATGA- 
AAGGATCATTATTGAATG- - 
rAGGATCATTAATGAATGT- 
ATGaATCATTAATGAA TTT- 
AAGaATCATTATTGAATTTT 
AAGGATCATTATTGAATTTT 
AAGGATCATTATTGAA TTT- 
AAGGATCATTATTGAATTTT 
AAGCATCATTAATaAATG- - 
.-A.GCATCATTAATGAATG- - 
AAGGATCATTAATtlAATG- - 
AAGaATCATTATCGA-ATG- - 
A.AGGATCATTATCa-ATG- - 
AAGGATC-.TTATTGAATG- 
AAGGATCATTATTGAATGA- 
AAGGATCATTATTGAATGA- 
AAGaATCATTATrGAATCA- 
AAGGATCATTATTG AATGA - 
AAGGATCATTATTGAATCA- 
AAGGATCATTATTa-ATaA- 
A- -GGTC-TTAATaAATGA- 
A- -GGTC-TTAATGAATCA- 
AGGaATCATTACCGAGTT- - 



--_AT- 
AAT- 
AAT- 
,--AT- 
A-AT- 
A_AT- 
A.-.T- 
Ai.T- 
A--.T- 
AAT- 
AAT- 
A-AT- 



AAC= 
A-AC- 
A-AC- 
A_AC- 
AA.— " 
AAT- 
AAT- 
AAT- 
AAT- 
AAT- 
AAT- 
A.AT- 
AAT- 
A-AT- 
A.AT- 
AAC' 
A.AC 
A-AC 



GT ATTITGGTTG 

GT ATTTAGGTTG 

CTT ATTTTGGTTG 

GT ATTTAGGTTG 

GT ATTTAGGTTG 

GT ATTTTGGTTG 

GT ATTTr*'XrrTG 

GT ATTT.-jGGTTG 

GT ATTT.-jGGTTG 

GT ATTT ACGTTG 

GT ATTT.-GGTTG 

GT ATTT.-GGTTG 

GT ATTTAGGTTG 

GT ATTT.-GGTTG 

GT ATTTAGGTTG 

GT ATTT.-GGTTG 

GTATTT.-X}GTTC J 

GTGTTTAG GTT G 

GTGTIT.-GGTTG 

AT AGAG - - TTGGTTG 
AT AGAG - - TTGGTTG 
AT AGAG - - TTGGTTG 

■ AT AGAG - - nXIGTTG 

■ AT AGAG - - TTGGTTG 

• AT AGAG - - TTGGTTG 

- AT AGAG - - TTG GTTG 

■ AT AGAG - - TTGGTTG 

■ AT AGAG - - TTGGTTG 

- GTAGAGT- TTGGTTG 

- GTAGAGT- TTGGTTG 
-GTAGAGT- TTGGTTG 
-TTGGAGTC- -GGTTG 
-CTAGAGTT- -GGTTG 
^CTTGGAGTC - -GGTTG 

- TTAGAGTC - -GGTTG 
-TTAGAGTC- - GGTTG 

- TT AGAG - - TCGGTTG 
-CTAGAGTT- -GGTTG 

- GT AGAGTT - - GGTTG 

- GTGGAGT-T -GGTTG 
-GTGGAGT- T-GGTTG 
-GAGGAGT- TTGGTTG 

- GAGGAGT- T -GGTTG 

- GTAGAGT- T-GGTTG 

- GAGGAGT - TTGGTTG 

- GTAGAGT - C - GGTTG 

■ -GTAGAGT-C-GGTTG 

• - GTAGAGT- C -GGTTG 

• - AATGGGGATCGGTTG 
' - AATGGGGATCGGTTG 

• - ATGGAGT- TTGCTTTG 

- - CTGGAGT- TTC-GTTG 

- -CTAGAGTT- -GGTTG 

- -CTAGAGTT- -GGTTG 

- -CTAGAGTT- -GGTTG 

- - CTAGAGTT - - GGTTG 

- - CTAGAGTT - -CXTTTG 

■---GGGAGTT G 

GGGAGTT G 

TACAA C 



^^116 


TAGCTGGCTC-- 


-CATTA-- 




TAGCTGGCTC-- 


-CATTA-- 




TAGCTGGCTC-- 


-CATTA-- 


NO 


TAGCTGCXTC-- 


-CATTA-- 


C3t)fS 


TAGCTCO^CTC-- 


-CATTA-- 


GUI 


TAGCTGGCTC-- 


-CATTA- - 


11(11 


TAGCTGGCrC-- 


-CGTTA-- 


G4 


TAGCTGGCTC-- 


-CATTA-- 


RGR39 


TAGCTC3GCTC-- 


-CATTA-- 


L91S 


TAGCTGGCTC-- 


-CATTA-- 


RGR39 


TAGCTGGCTC-- 


-CATTA-- 


SLP33 


TAGCTGGCTC-- 


-CATTA-- 


9215512 


TAGCTGGCTC-- 


-CATTA-- 


Kl 


TAGCTGGCTC-- 


-CATTA-- 


9215513 


TAGCTGTCTC-- 


--CATTA-- 


Sdos 


TAQCTGGCTC-- 


- -CATTA- - 


R64 


TAGCTGGCTC-- 


■-CATTA-- 


R56 


TAGCTGGCTC-- 


■ -CATTA- - 


R65 


TAGCTGGCTC-- 


- -CATTA- - 


S5 


TCGCTGGCT--- 


--OCT 


S6 


TCGCTGGCr--- 


--CCT 


Fi 


TCGCTGGCT--- 


--CCT 


F2 


TCGCTGGCT--- 


--CCT 


F3 


TCGCTGGCT-- 


--OCT 


F5 


TCGCTGGCT-- 


--CCT 


F7 


TCGCTGGCr-- 


--CCT 


F6 


TCGCTGGCT-- 


--CCT 


SI 


TCGCTGGCT-- 


--CCT 


A2 


TAGCTGGCr- - 


- -CCTAATT 


Al 


TAGCTGGCT- - 


--CCTAATT 


B 


TAGCTGGCT- - 


--CCTAATT 


AGK 


TCGCTGGCTG- 


--TCT T 


AGN 


TCGCTGGCTG- 


--TCTT 


AGP 


TCGCTGGCTG- 


--TCTT 


AGBO 


TCGCTGGCTG- 


--TCTT 


AGBB 


TCGCTGGCTG- 


--TCTT 


S3 


TCGCTGGCT-- 


GTC 


AGJ 


TCGCTGGCCC- 


---CT 


AGG 


TCGCTGGCCC- 


TT 


05 


TAGCTGGTC-- 


CTAATT 


C8 


TAGCTOCiTC-- 


CTAATT 


C2 


TAGCTGGCC- - 


CTGATT 


C4 


TTGCTGUCC- - 


TTA.ATT 


cr> 


TAGCTGGC 


IT 


Cl 


TAGCTGGCC-- 


---CTGATT 


Cll 


TAGCTGCJGT- - 


CT T 


C12 


TAGCTGGCT- - 


CT T 


C13 


TAGCTGGGT- - 


CT T 


S2 


TCGCTGGCTT- 


CTT 


S4 


TCGCTGGCTT- 


CTT 


AGL 


TTGCTGGCCC- 


CT 


AGI 


TTGCTGGCCC- 


CT 


AGC 


TCGCTGGCCC- 


CT 


AGA 


TCGCTGGCCC- 


CT 


CAG2 


TCGCTGGCCC- 


CT 


AGD 


TCGCTGGCCC- 


---CT 


AGE 


TCOCTGGCCC- 


CT 


AGR 


ATGCTGGCCA- 




AGS 


ATGCTGGCCA- 




THARZIATIU 


TCCCAAAC 


--CC 



GTTTGGA-GCATG- 

GTTTCaA-GCATG- 

GTITGGA-GCATGA 

GTTTGGA-GC^TG- 

GnTGGA-GCATG- 

ATTTCGA-GCATG- 

GTTTGGA-GCATG- 

CTTIXXiA-GCATG- 

GTTTOGA-GCATG- 

GTITGGA- GCATG - 

GTITGaA-GCATG- 

GTTTGGA-GCATG- 

GTTTGaA-GCATG- 

GTTTGGA-GCATG- 

GTITGGA-GCATG- 

GTITGGA-GCATG- 

GTTTGGA-GCATG- 

GrrrGGA-GCATG- 

GTITGGA-GCATG- 

C-CGGGA-GCAT-- 

C-CGGGA-GCAT-- 

C-CGCaA-GCAT-- 

C-CGGGA-GCAT-- 

C-CGGGA-GCAT-- 

C-CGGGA-GCAT-- 

C-CGGaA-GCAT-- 

C -CGGGA -GCAT— 

CGCGOGA-GCAT-- 

AAACTTGGGG-GCAT-- 

- - - AAACTTGGGG -GCAT — 
---AAACTTGCXSG-GCAT— 

ATTTGGCA-GCAT- - 

TGGCA-GCAT-- 

TGGCA-GCAT- - 

TGGCA-GCAT- - 

TGGCA-GCAT- - 

TTTCGCA-GCAT-- 

T CACTGGG- -GCAT- - 

T c AC KGGG - GT AT — 

TCCTTTGG--ACAT-- 

TCCTTTGGG-TC AT- - 

G -TTTCCTTTGG - -CAATTT 
G-TTCCA GG CACT- 

- - - TTCrcnX5GGG - GCG - - - 
G -TTTCCTTTGG --- CAATT 
TTAATCaA- - GG - - CC AT — 
TTAATCGA- -GG - -CCAT- - 
TTAATCGA- -GG - -CCAT- - 

CATTGAG -GCAT — 

CATTGAG-GCAT- - 

T TATrGGG--GCAT-- 

T TATTGGG- -GCAT — 

T CACTGGG- -GCAT— 

T CACTGGG--GCAT-- 

T CACTGGG- -GCAT-- 

T CACTGGG - -GCAT- - 

T CACTGGGGCCAT- - 

- - -GGrrCTCTGGC- - -AT- - 

GGTCTCTGGC AT- - 

AATGTCAACCAT-- 



TGC ACACdTTTGCTCT 

TGC AC ACCTITTGCTCT 

TGCAC ACCTTTTGCTCT 

- - - TGCACACCTTTTGCTCT 

TGCACACCTTTTGCTCT 

TGCACACCTTTTGCTCT 

TGCACACCmTGCTCT 

- - - TGCACACCTTTTGCTCT 

- - - TGCACACCTTTTGCTCT 
TGCACACCTTTTGCTCT 

- - - TGCACACCTTTTGCTCT 

TGCACACCTTTTGCTCT 

- - -TGCACACCTTTTGCTCT 
TGC AC ACCTITTGCTCT 

- - - TGCACACCTTTTGCTCT 

TGGAC ACCTTTTGCTCT 

TGCACACCTTTTGCTCT 

- - - TGCAC ACCTITTGCTCT 

TGCACACCTTTTGCTCT 

--GTGCACGC-TIT- -CTCT 
— GTGCACGC-TTT- -CTCT 
- -GTGCACGC - TTT- -CTCT 
- -GTGCACGC-TTT- -CTCT 

- - GTGCACGC-TTT — CTCT 

- GTGCACGC-TTT- -CTCT 
--GTGCACGC-TTT- -CTCT 
--GTGCACAC-TTT--CTCT 
--C?rGCACGC-TTr--CTCT 
--GTGCACACCTTT--CTCT 
--GTGCACACCnT--CTCT 

- GTGCACACCTTT- -CTCT 
--GTGCACGCCT-T--CTCT 
--GTGCACGCCT-T- -CTCT 

- -GTGCACGCCT-T- -CTCT 
— GTGCACGCCT-T- -CTCT 
—GTGCACGCCT-T- -CTCT 
--GTGCACC«:-CTT--CTCT 
- -GTGCACGCCT-T- -CTCT 
—GTGCACGCCT-T- -CTCT 

- GTGCACACCT-T- -CTCT 

- - GTGCAC ACCT -T- -CTCT 
-TGTGCACGCCTCT- -CTCT 
-TGTGCACACCT-T- -CTCT 
-GGTGCACACCT-T- -CTCT 
TTGTGCACGCCTCT — CTCT 
--GTGCACGCCT-T- -CTCT 

- - CTGCACACCT -T- -CTCT 
—GTGCACGCCT-T- -CTCT 

- -GTGCACGCCnr — CTCT 

- - GTGCACGCCTTT- -CTCT 
--GTGCACACCT-C--CTCT 
— GTGCACACCT-C- -CTCT 
— GTGCACGCCT-C- -CTCT 

- - GTGCACGCCT-C - -CTCT 
--GTGCACCCCT-C--CTCT 
-- GTGCACGCCT-C - -CTCT 
--GTGCACGCCT-C- -CTCT 

- -GTCCICGCrrCCGACAAT 
— GTGCTCGCTTCCGACAAT 

- - - ACC AAACTGTTGCCTCG 



'ITi'lTi'- AATCC- - ACAC^C - 

'ITITIT - AATCC - - AC AC=*.C - 

' 1 ■ I'l' 1 ■ IT - nJ<TCC - - AC AC-.C - 

TITTTT - AATCC --.-CAOvC - 

Tmrr - AATCC --.-jCacac - 

TTTTTT-AATCC--ACAC^C - 
TTTTTT-AATCC- -ACAC^C - 
TlTTiT-AATCC- -ACAC\C - 
TITTTT- AATCC- -ACAC^C - 
TITTTT - AATCC --ACAC^vC - 
TTTTTT- AATCC- - ACAC^C - 
TTTTTT-AATCC- -ACACAC • 
TTTTTT- AATCC --ACACkC - 
TTTTTT-AATCC- -ACAC--C • 
Trmr- AATCC- -ACACAC ■ 
TmTT- AATCC - - ACACAC ■ 
TTTTrTTAATCC--ACAC=.C ■ 
Tl ' i ' i ' l ' i ' i 'AATCC- - ACACAC • 
TTTTTTT.\ATCC----i:AC=tC ■ 
- -TTC- — ATCC- -ACACAC 

--TIC ATCC--ACAC.iLC 

— TTC- - - ATCC- - .-jCAC=iC 

- -TTC ATCC - - ACACAC 

- - TTC- — ATCC- - ACACaC 

- -TTC ATCC- - ACACAC 

- - TTC ATCC - - AC A.CAC 

- -TTC ATCC- - ACACAC 

- -TTC - - - ATCC- - ACACAC 

- -TTC ATCCC - ATACAC 

- -TTC ATCCC - ATACAC 

- -TTC - - - ATCCC - ATACAC 

T-TTC ATCCACACAC^C 

T-TIC ATCCACACAC- - 

A-TTC - - - ATCC- - ACACAC 
A-TTC - - - ATCC- - ACACAC 
A-TTC - - - ATCC- - ACAC\C 
A- TTC - - - ATCC- - AC AC=.C 
- -TTC - - - ATCC- - ACACAC 

- -TTC ATCC- - ACACAC 

- -TIC ATCC- - .-CAC^^C 

- - TTC ATCC- - .-CACnC 

- -TTC - - - ATCC- - AjCACAC 

- -TTC - - - ATCC- - ACACAC 

- -TTC ATCC- - ACAC=vC 

—TTC ATCC- - ACACAC 

- -TTC ATCC- - ACACAC 

- - TIC ATCC - - AC-.CAC 

- -TTC ATCC- - .-CAC=vC 

ATTIC ATCC- - ACACAC 

ATTTC ATCC- - ACACAC 

A-TTC ATCC - - ACACAC 

A-TTC ATCC - - ACACAC 

A- TTC ATCC - - AC^C=iC 

A- TTC - - - ATCC- - ACACAC 
A-TTC - - - ATCC - - iCACAC 

A- TTC ATCC - - ACACAC 

A-TTC - - - ATCC- - .-CACAC 

C - TT ATCC - - AT AC - - 

C-TT ATCC- - ATAC- - 

- - - GCGGGGTCAJCGCCCCGG 



- ACCTGTa--ACCT:?rG - 

- iCCTC?rCL--ACCTGTG - 
-ACCTGrCi-AC 
-.ACCTGrCi-ACC 

- ACCTGTG.-_-.CCTGTG 

- .-CCTGTGA-ACCT3TG - A 

- ACCTGTa-.-.CCTCTTG - ~ 

- .-C:C"iG'i'G.--ACCTC7rG - .- 

- ACCTGTC-.ACCTGTG - .- 

■ - ACCTGTC-_ACCTC7rG - 

- CCCTGTGAA.CCTGTG - = 

■ - iCCTGTa--ACCTGTG- = 

■ - ,"CCTGTG.--ACCTGTG - 

■ - ACCTGTCCA.CTTCTrG - = 

■ - ACGTGTGCACTTCrrG - .- 

■ - ACCTGTGC ACTTJIX: - 
TTGCACT 



-ACC 

- .-jCCTGTGC ACTTGTG - AG 

- ACCTGTGCACTTGTg - .'J3 

- ACCTGTGCACTTGTG - A-G 
-ACCTCT 
-ijCCTGTGCAC 

- ACCTGTGCA.CCTGTC 

- iCCTGTGCA; 

- iCCTGTGCACCTGTG - 

- CCCTGTGCACTrGTG - 
-CCCrGTGCACTTGTG- 

- CCCTGTGC.-.CTTGrG - 

- CCCTGTGCACCTGTG - 

- CCCTGTGCACCTGTG - 

- ccctgtgca-Ct: 

-ACCTGTGC 





- .-cctgtgca ctt: 

- ACCTGrGAACTTCrrvj - 

- ACCTGTa-_-.CT--.:7rG - 
-ACCTGTGCACr?^- 

- ACCTGTGC ACTTGTG - 
-ACCTGTG.--ACC 

- AC CTGTGCACTTCTG 

- ACCTGTGCACCrrjrr 

- ACCTGTGCACCTGTT 
-ACCTGTGC-.CC 



TG- 




- iCCTGTaAACTTCTG - 
--AC CTGT a-.AC 

- - iCCTGTC--ACTTGTC 
--ACCTGTCA 
--ACCrGTCA-AC 

- - .-CCTGTaA.-.CTCCCG - 

- - ACCTGTCAi.CI* ~CCG - 

- - ACCTGTa-_Acrrc 

- - ACCTGTGCAC 

--ACCTGTGCAC 

GTGCGTCGCAGCCCCC; 



TAG 
T.-J3 



■> CXAGAtX; CATOaHTGGCi 

RGi- J 9 GCA-iJ^-aA CATGGATGGG 

92(17 GC-jCI^C iG C\TGGATGGG 

NO GCAGAGA CnTGGATGGG 

C 3 M (5 CK; AG ACln - - - C ATCGATGGG 

GUI GCAGArJ\ CATGGATGtXS 

-iJ : : ( 1 i GC A---^a^AA - C^TGai^TGGC 

G4 GCAA.\a^.AA-CATGGATC;GG 

RGR3 9 GCAAAa=^^^- CATGGATGGG 

L9 1 S GCAAAGAAA-CaTGGATGGG 

Rr;R3 8 GCAGAGAAA-O^TGGATGGG 

SLP3 3 GCAaAGAAA-CATGGATCa^ 

9215 512 GCAGAGA C^TGGA1X^,GG 

Kl GCAGAa\ CATGGATGGG 

9 2 1 S 5 1 3 GCAGAC^ CATGGATGGG 

Sdos GCAGAGrv CATGGATGGG 

R6 4 GO AG Aa^v CATGGATGGG 

R56 GCAGAGA CATGGATGGG 

R65 GCAGAGA CATGGATGGG 

S5 ACGGA GGACCGT 

55 ACGan GGACCGT 

Fl ACGa^v --GGACCGT 

F2 ACGGA GGACCGT 

F3 ACGGA - - - GGACCGT 

F5 ACGGA GGACCGT 

F7 ACGa^ GGACCGT 

F6 ACGGA GGACCGT 

51 ACCaA GGACCGT 

A2 ACAGATGTTTTGTGGGGGGG 

Al ACAGATGTTTTGTGGCXXXX5 

B ACAGATGTTTTGTGGGCX^GG 

AGK ACT GGAGACCGG 

AGN ACTA GGAGGCC 

AGP ACT - -GGAGGCC 

AGBO ACT- GGAcrCC 

AGBB ACT GGAGCCC 

53 ACTGG AGGCCGT 

AGJ AC GGAGGGC 

AGG AC GGAGGGC 

C5 GCAGA GGACCGT 

C8 C5CAGA GGrtCCGT 

C2 ACAGA GGACCGT 

C4 ACAC;A GGACCGT 

C6 ACAa^ GGAC-GT 

CI ACAGA aiACCGT 

C 1 1 ACGGT Ca^AGGA 

C 1 2 ACGGT TG AAGGA 

C 1 3 ACCXrr aiAAGGA 

52 ACGaA TAGTAGT 

54 ACCa^ GGACCGT 

AGL ACGGATA GT 

AG I ACGGATA GT 

AGC ACGGATA GT 

AGA ACGGATA GT 

CAG2 ACGGATA tTT 

AGT> ACGGATA GT 

AGE ACGGATA GT 

AGR ACTG TGAGGGC 

AGS ACTG TGAGGGC 

THAK2IANU CCAGGCGCCCGCCGGAGGaA 



AG- 
AG- 
AG- 
AG- 
AG- 
AG- 
CA- 
CA- 
CA- 
CA- 
CA- 
CA- 



AA- 

AA- 
AA- 
AA- 



AACTTTCATTTACC TAAA\A-TAA.iTGrA1TGGG ACCCCCATCCCCCC- -TCTG 
.---.CTTTCATTTACC TAAAAA-TAAATCTATrGGG ACCCCCATCCCCCC -- TCTG 
A-^CTTTCATTTACC TAAAAA-T.'^AATGTATTGGG .-JCCCCCATCCCCCC--TCTG 
AACTTTCATTTACC TAA-AAA-TAAATCTATrGGG ACCCCCATCCCCCC --TCTG 
A-nCTTTCATrTACT TTAAAA-TGAATG-ATTGCG ACCCCCATCCCCCC- -TCTG 
A-ACTTTCAnTACT TAAAAA-TAAATGTATTGGG AnCCCCAACCCCCC--TCrG 
AACTTTCATTTACT TTTAAA-TAAGTG- ATTGGG ACCCCTACCCCCCCC-TCTG 
AACTTTCATTTACT TTTAAA-TAAGTG- ATTGGG ACCCCTACCCCCCC--TCTG 
A.-XmTCATTTACT TTTAAA-TAAGTG- ATTGGG ACCCCTACCCCCCC--TCTG 
AACTTTCATTTACT TITAAA-TAAGTG- ATTX^ ACCCCTACCCCCCCC-TCTG 
AACTTTCATTTACT TTTAAA-TAAGTG -ATTGGG ACCCCTACCCTCCCC-TCTG 
AACTTTCATTTACT TTTAAA-TAAGTG- ATTGGG ACCCCTACCCTCCCC-TCTG 

A.^CTTrTGTTTACT TTTAGG-TanATG- ATTGGG ACCCCTACCCCCC TCTG 

AACTTTTGTTTACT TTTAGG-TGAATG- ATTGGG ACCCCTACCCCCC TCTG 

ACCmTGTTTACT TTTAGG-TGAATG -ATTGGG ACCCCTACCCCCC TCTG 

AimTTGTTTACT TTTAGG -TGAATG- ATTGGG ACCCCT--CCCCCC---TCTG 

AACTTTCATTTACT TTTAGG -TGAATG -ATTGGG ACCCCTACCCCCC TCTG 

AACTTTTGTTTACT TTTAGGGTAAATG- ATTGGG ACCCCTACCCCCCC- -TCTG 
AACrTTTGTTTACT TTTAGGGTAAATG- ATTCXXJ ACCCCTACCCCCCC --TCTG 

AA AAAA GTCTTCC G 

AAAA ---GTCTTCC- G 

;^ViiA GTCTTCC G 

AAAA GTCTTCC G 

AAAA GTCTTCC G 

AAAT GTCTTCC G 

a^j, GTCTTCC G 

AA A.AAA — -GTCTTCC G 

■ GTCTTCC G 

CTCCCCCTTGACTTCTG 

---CTCCCCCTTaACTTCTG 

CTCCCCCTTGACTTCTG 

CT CTTCA G 

GCTTTCA G 

GCCTTCA G 

- -GCCTTC?. G 

GCCTTCA G 

--GCCTTCA G 

C?rCTTCC G 

GTCTTCC G 

CTG 

CTG 

CTG 

- -CTG 

CTG 

CTG 

CTG 

CTG 

CTG 

--GTCTTC G 

C7TCTTC TG 

GAGGTCC G 

GAGGTCC G 

GAGGTCC G 

- GAGGTCC G 

--GAarrcc g 

-a-.GGTCC G 

GAGGTCC G 

GCAACTTGCC 

GCA.-.GTGCC 



AA AAAA 

AA--GGAACTTTATTGGACC TTCTA- 

AA--GGAACTTTATTGa»LCC TTC 

AA--GGAACTTTATTGGACC TTC 

TAA AAAAA 

GT AAAAA 

GT AAAAA ' 

GT AAAAA 

CT AAAAA 



-TA- 
-TA- 





TTT AATTA 

TTT AATTA 

AA AAAAA GTCT T * 

AA AAAA GTCT T 

AA "AAAA GTCT T 

AA AAAA GTCT T 

AAAA GTCT T 

AAAA GTCT T 

AAGTCTTTCTCGCG AGAGAGAGG- 
AAGTCTATCTCGCG Aa^GAGAGG- 
AAGTCTTTCTCGCG AGAGAGAGG- 

AAAAA 

AAAA 

-CCTCTCGGGGGC- 

-CCTCTCGGGGGC- 

-CCTTTCGGGGGC- -- 

-CCTTTCGGGGGC- 

-CCTTTCCX2GGGC- 

-CCTTTCGGGGGC- 

-CCTTTCGGCCGC- 



AA 

AA 



AA-- 



AA 

AA 

AGC-- 
AGC-- 
AGC-- 
AGT-- 
AGT-- 
AGC-- 
AGC-- 

AT 

AT 

CCAACCAAA- 



TCTACTC.--iirr-CTA-^- - - 
TCTACTC.-ACT-CTAa- - - 
TCTACTC----.CT-CTAA- - - 
TCTACTC.-ACT-CT.-ik- - - 

TCT ACTC .---.CT - CT An 

TCTACTC-AjCT -CTAA 

TCTACTC.-ACT-CTAi.- - - 
TCTACTC.-ACT-CTAA- - - 

TCTA.CTC.---.CT - CTAA 

TCTACrc.---.CT-CTAA 

TCTACTC.-ACT - CTAA 

TCT ACTC AACT - CTAA 

TCTACTCA-iCT-CTAA— - 

TCTACTC.---.CT-CTAA 

TCTACTC.----CT-CTAA 

TCTACTC---.CT-CTAA 

TCTACTC---.CT-CTAA 

TCTACTC----.CT-CTAA- - - 

TCTACTC-.--^T - CTAA 

TCTATT.-_---.C - - 

TCTATT.-.-AC CA- - - 

TCTATTi_-A.C CA 

TCTA7T------.C C=v 

TCTATTA.-AC CA 

TCTATTA.---X: CA'-- 

TCTATT.-.--,-jC CA 

TCTATT A.---.C CA 

TCTATT--.--ijC C\ 

TCTACTT.-ATT C\ 

TCTACTT AATT CA 

TCTACTT.-_-.TT C-\ 

TCTGCTA.-ATC CA- — 

TCTGCTA--A.TT CA 

TCTGCT.~.TT - - - CA- - - 

TCTGCT.---_-.TT CA 

TCTGCT--J..iTT- - - - 

TCTGCTA_-_-.TT CA 

TCTACTi.-AT C 

TCTATTC-AC C 

TCTACTT.-A TTTA 

TCTACTT.-A TTTA 

TCTACTT.--A TCC-.- -- 

TCTACTT A.A TTTA 

TCTACTT.--'^ TTCA 

TCTACTT.-A TCCA- - ■ 

GCTCCTTTTCCGTCC-A - ■ 
GCTCCTTTrCCGTCCA.\- ■ 
GCTCCi'i'i'iUCGTCa-A - ■ 

TCT - GTC.-AT A-- 

TCT A 

TCTC-CTAT.AC 

TCTGCTAT.-jC 

TCTGCTATAC - 

TCTGCTAT.-jC 

tctgctatac 

tctgctatac 

tctgctatac 

-cttatttac 

-cttattt.-a; 



1-1 




j9 
207 
MO 
306 
GUI 

G4 
RGR39 
L91S 
RGR38 
SLP33 
9215512 
Kl 

9215513 
Sdos 

P.65 
S5 
S(S 
Fl 
F2 
F3 
F5 
F7 
F6 
SI 
A2 

B 

AGK 

^ AGN 
■ AGP 

^ AGBO 
AGBB 
S3 

AGJ 
ACX^ 
C5 
C8 
C2 
C4 
C6 
CI 
Oil 
CI 2 
C13 
S2 
S4 
AGL 
AG I 
AGC 
AGA 
CAG2 
AGD 
AGE 
AGR 
AGS 

TK.AR2I.-J^ 



- -TATA-.-ACCCAATTTATC 
- -TATA-A.-.CCC^^ TTT ATT 

- -TATA- AACCCAATTTATT 

- -TATA- AACCCAATTTATC 

- -TATA- AACCCA-^ TTT ATT 

- - TATA- A.-.CCCAATTTATC 

- - TATA- AACCCAATTTATT 

- - TATA- AACCCAATTTATT 

- - TATA- AACCCAATTTATT 

- -TATA- AACCCAATTTATT 

- - TATA- AACCCAA TTT ATT 

- - TATA- AACCCAATTTATT 
- -TATA- AACCCAATTTATT 

- - TATA- AACCCAATTTATT 

- -TATA-AACCCAArrrATT 

- -TATA-.-ACCCAATTTATT 

- -TACA- AACCCA/^. TTTA TT 

- - TATA- AACLX AATTTATT 

- - TATA- AACCCAATTTATT 

- - CACA- AACCCCATTGTAT 
- -CACA- AACCCCATCGTAT 

- - CACA- AACCCCATTGTAT 

- -OkCA- AACCCCATTGTAT 

- -CACA- AACCCCATTGTAT 

- - CACA- AACCCCATTGTAT 

- - CACA- .-ACCCCATTGTAT 

- - CACA- AACCCC ATTCrPAT 
- - CACA- AACCCCATTGTAT 
- -TATA-AACTCAATTTATT 
— TATA- AACTCAATTTATT 
_ -TATA-AACTCAATTTATT 

- -TATACAAACCCATT-TAA 

- -TATACAAACTCATT-TAi^ 

- -TATACAAACTCATT-TAA 

- -TATACAAACTCATT-TAA 

- -TATACAAACTCATT-TAA 

- -TATACAAACTCATT-TA^i 

ACACA.-ACTCATT-TAA 

AC ACAAACTCATTGTAT 

- - CACA- A.-.CTCTATTTATT 

- - CACA- CACTCTATTTATT 

- - CACA- CACTCTATTTAAT 
- - CACA- CACTCT ATTT AAT 

- - TATA- AAATC AATTTAAT 

- - CACA- CACTCTATTTAAT 
TACATA- AAATCTTATATAT 
T ACAT A- AAATCTTATATAT 
TACATA- AAATCTTATATAT 

- - CATA- AACTCCAA TTT AA 

- -CATA- AACTCCAATTTAA 
_ - -ATA- AACTCCAG- -TAT 

ATA- AACTCCAG- -TAT 

ATA- AACTCCAA- -TAT 

ATA- AACTCCAA- -TAT 

ATA- AACTCCAA- -TAT 

ATA- AACTCCAA- -TAT 

ATA-TACICCAA- -TAT 

- -TACATACACCTAA- -TAT 

- -TACATACACCTAA- -TAT 

- -TCCKXf^ACCTrTATITCTT 



TTA--_-AT-GAATGTAA-TCG 
TF ."----AT - G AATCTAA - TOG 
Tr;^--_-AT-GAATGTAA-TGG 
TTAi-AAT-GAATGTAA-TGG 
TTA.-.-AT -GAATGTAA -TGG 
TTA.-_-AT-GA.ATGTAA- TGG 
TTA.---AT-GAATGTAA-TGG 
TT.-Ji_-AT -GAATGTAA -TCX; 
TT A.---AT - GAATGTAA - TGG 
TTA.i--AT-aAATGTAA-TGG 
TTi--_AAT-aAATGTAA-TGG 
TTA.-J-AT-GAATGTAA-TGG 
TTA.---AT-GAATGTAA -TGG 
TTA-.- J-AT -GAATGTAA -TGG 
TTAAAAT - GAATGTAA -TGG 
TTAAAAT - GAATGTAA -TGG 
TTAA.AAC -GAATGTAA -TGG 
TTAA.-.AT-GAATGTAA-TGG 
TTAA-AAT-GAATGTAA-TGG 
TTA-i-ATT-GAATCTAA -TTG 
TT AAGTT - GAATGTAA - TTG 
TTAr_ATT - GAATGTAA - TTG 
TTAA-ATT - GAATGTAA -TTG 
TTAAATT -GAATGTAA -TTG 
TTAAATT - OAATGTAA- TTG 
TTAA-ATT - GAATGTAA- TTG 
TT^>_-ATT - GAATGTAA -TTG 
TTA.iATT - GJ^ATGTAA -TTG 
Tr.-.AAAC-GGATGTAA-TGG 
n-A.-.AAC - GGATGTAA -TGG 
TTAAAAC -GGATGTAA-TGG 
TTGA--JCT- GAATGTAAATTG 
TTGi-ACT-GAATGTAC-TTG 
TTCAACT-GAATCTAC-TTG 
TTai.-CT-aAATGTAC-TTG 
TTGAACT - G AATGTAC - TTG 
CTC-CACT -GAATGTAC - TTG 
TTA.--ATT- GAATGTAA -TTG 
TTAAACT - GAATGTAA -TTG 
TTAA-AAC-GAATGTAA-TTG 
TTA.--AAT-GAATGTAA-CAG 
TTCA-ACT -GAATGTAA-TTG 
TT i_--ACT - GAATGTAA-TTG 
TTAAAAC -GAATGTAA -TGG 
TTaAACT-GAATGTAA-TTG 
CTAATCA-GAATGTAA-TCG 
TTi-ATCA-GAATGTAA-TCG 
CTA-ATCA-C^AATGTAA-TCG 
TAAATCT -GAATGTAA- TTG 
TAAATCT -GAATGTAA-TTG 
ATAAATTTGAATGTCA TTTG 
AT.----ATTTGAATGTCA TITG 
AT.-J^-ATTTGAATGTAATTTG 
ATAAATG-GAATGTA ATTTG 
ATA.--ATG -GAATGTAAITTG 
ATA_--ATrTGA?-.TGTAATTTG 
AT.i-AATTTGAATGTAATTTG 
GG.---.ATC - GCACGTGACTTG 
GG^-AA.TC-GCACGTGACTTG 
ACAGCTCTGAGCAAAAATTC 



ATGTAA-CGCATC- 

ATGTAA - CGCATC - 

ATGTAA-CGCATG- 

ATGTAA - CGCATC - 

ATGTAA-CGCATC- 

ATGTAA -CGCATC- 

ATGTAA-CGCATC- 

ATGTAA -CGCATC- 

ATGTAA-CGCATG- 

ATGTAA-CGCATC- 

ATGTAA-CGCATG - 

------ ATGTAA-CGCATC- 

ATGTAA-CCCATC- 

ATGTAA-CGCATC- 

ATGTAA-CCCATC- 

ATGTAA-CGCATC- 

ATGTAA-CGCATC- 

ATGTAA-CGCATC- 

ATGTAA-CGCATC- 

ATGTAA-CGCATCA 

ATGTAA-CGCATCA 

ATGTAA-CGCATCA 

ATGTAA-CGCATCA 

ATGTAA-CGCATCA 

ATGTAA-CGCATCA 

ATGTAA-CGCATCA 

ATGTAA-CGCATCA 

ATGTAA-CGCATCA 

ATGTAA - C ACATC - 

ATGTAA-CACATC- 

ATGTAA-CACATC- 

ATGTAA-CGCATCT 

ATGTAA -CGCATCA 

ATGTAA -CGCATCA 

ATGTAA-CGCATCA 

ATGTAA-CGCATCA 

ATGTAA-CGCATCA 

ATGTAA-CGCATCA 

ATGTAA-CGCATCA 

ATGTAA-CGCATC- 

ATGTAA-CGCATC- 

ATGTAA-CGCATC- 

ATGTAA-CGCATC- 

ATGTAA-CGCATC- 

ATGTAA-CGCATC- 

ATGTAAACGCATC- 

ATGTAAACGCATC- 

ATGTAAACGCATC- 

ATGTAA -CACATCT 

ATGTAA-CACATCT 

ATGTAA - CACATCT 

ATGTAA-CACATCT 

ATGTAA- CC»CATCT 

ATGTAA-CGGATCT 

ATGTAA-CGGATCT 

ATGTAA-CGCATCT 

ATGTAA-CGCATCT 

CGTCGTATGGCG -CTTAAAA 
CGTCGTATGGCG-CTTAAAA 
AAAA-TGAATC- 



-TAAT- -ACTAAGTTTCAAC /lACGCATCTCTTCiGCTCTCC; 
-TAAT- -ACTAAGTTTCAAC .-ACGCATCTCTTGCJCTCTCG 
-T.AAT--ACTArjGnTCAAC ^-ACGCATCTCTTGGCTCTCG 
-TAAT- -ACTAAGTTTCAAC AACGCATCTCTTGGCTCTCC 
-TAAT- -ACTAAGTTTCAAC AACGCATCTCTTGGCTCTCC 
-TAAT- -ACTAAGTTTCAAC .AACGGATCTCTTGGCTCTCG 
-TAAT- -ACTAAGTTTCAAC A.ACGCATCTCTTGGCTCICG 
-TAAT- -ACTAAGTTTCAAC AACGCATCTCTTCGCTCTCG 
-TAAT--ACT--AGTTrCAAC AACGCAT CTCTT GGCTCTCG 
-TAAT- -ACTAAGTTTCAAC AACGCATCTCTTGGCTCTCG 
-TAAT- -ACTAAGTTTCAAC AACGCATCTCTTGGCTCTCG 
- TAAT - - ACTAAGTTTCAAC AACGC ATCTCTrGGCTCTCG 
-TAAT- -ACTAAGTTTCAAC AACGCATCTCTTGGCTCTCG 
-TAAT- -ACTAAGTTTCAAC AACC-CAT CTCTTG GCTCTCG 
-TA.AT- -ACTAAGTTTCAAC AACGCATCTCTTGGCTCTCG 
-TAAT--ACTAAGTrTCAAC AACGCATCTCTTGGCTCTCG 
-TAAT- -ACTAAGTTTCAAC AACGCATCTCTTGGCTCTCG 
-TAAT- -ACTAAGTTTCAAC AACGG^.TCTCTTGGCTCTCG 
-TAAT- -ACTAAGTTTCAAC AACGaATCTCTTGGCTCTCG 
-TTAG-AACTAAGTITCAAC AACGGATCTCTTGGCTCTCG 
-TTAG-AACTAAGTTTCAAC AACGGATCTCTTGGCTCTCG 
-TTAA-AACTAAGTTTCAAC AACGGATCTCTTGGCTCTCG 
-TTAGGAACTAAGTTTCAAC AACGaATCTCTTGGCTCTCG 
-TTAG-AACTAAGTTTCAAC AACGaATCTCTTGGCTCTCG 
-TTAG-AACTAAGTTTCAAC AACGaATCTCTTGGCTCTCG 
-TTAG-AACTAAGTTTCAAC AACGaATCTCTTGGCTCTCG 
-TTAG-AACTrJCTTTCAAC AACGGATCTCTTGGCTCTCG 
-TTAG-AACTAAGTrrCAAC AACGaATCTCTTGGCTCTCG 
-TAA--TACTAAGTITCAAC AACGaATCl C i' IG GCTCTCG 
-TAA--TACTAAGTTGCAAC AACGaAT CTCTT GGCTCTCG 
-TnA--TACTAAGTTTCAAC AACGaATCTCTTGGCTCTCG 

TTA AACTAAGTTTCAAC AACGGAT VlCj-iG GCTCTCG 

TTA AACTAAGTTTCAAC AACGaATCTCTTGGCTCTCG 

TTA AACTAAGTTTCAAC AACGaATCTCTTGGCTCTCG 

TTA rACTA^GTTTCAAC AACGGATCTCTTGGCTCTCG 

'XTA AACTAAGTTTCAAC AACGGATCTCTTGGCTCTCG 

-jTAC ACTAAGTTTCAAC AACGaATCTCTTGGCTCTCG 

TTAG-- AACTAAGTTTCAAC AACGGATCTCTTGGCTCTCG 
TTAG- -AACTAAGTTTCAAC A.ACGGATCT:TTGGCTC?CG 
-TA--ATACTA.-GTTTCAAC AACGGATCTCTTGGCTCrC G 

- TA - - TTACT AAGTITC AAC AACGGATCTCTTGGCTCTCG 

- TA - - ATACTAAGTTTCAAC AACGGATCTXrrTGGCTCTCG 
_TA--ATACTAAGTITCAAC AACGaATCTCTTGGCTCTCG 
-TA- - ATACTAAGTTrCA.AC A.ACGaATCTXrTTGGCTCTCG 
-TA- - ATACTAAGTTTCAAC AACGaATCTCTTGGCTCTCG 
-TA-TAAACTAAGTTTCAAC AACGGATCTCGTGGCTCTCG 
-TA-TAAACTAAGTTTCAAC AACGGAT CTCGTG GCTCTCG 
-TA-TAAACTAACTTTCAAC AACGaATCTdTGGCrCTCG 

TTAA ACTAAGTTTCAAC AACGaATCTCTTGGCTCTCG 

-jTAA ACTAAGTTTCAAC AACGGATCTCTTGGCTCTCG 

ATA AACTAAGTTTCAAC AACGaATCTCTTGGCTCTCG 

ATA AACTAAGTTTCAAC AACGGATCTCTTGGCTCTCG 

TTA AACTA.AGTTTCAAC AACGGATCTCTTGGCTCTCG 

TTA AACTAAGTTTCAAC AACGaATCTCTTGGCTCTCG 

TTA AACTAAGmCAAC AACGaATCTCTTGGCTCTCG 

TTA AACTAAGTTTCAAC AACGGATCTCTTGGCTCTCG 

TTA- --AACTAAGTTTCAAC AACGGATCTCTTGGCTCTTG 
CTAAA- AAC - AAGTTTCAAC A.ACGaATCTCTTGGCTC:CG 
CTAAA- A^C- AAGTTTCAAC AACGGATCTCTTGGCTCTCG 

AACTTTCAAC AACGaATCTCTTGGTTCIGG 




:> CA'T\:aATCl----GAACGCiJGCG 

RGrv - CATVGATGA.iaAACCCAGCG 

y 2 (17 C ATO jATaAACLAACGCAGCG 

NO CATCtlATGAAGAACGCAGCG 

C 3 1 1 6 CATCGATaAAGAACGCAGCG 

GU: CATCaATG--AaAACGCAGCG 

AJ 1 1 n 1 C A1\:aATa--AaAACGCAGCG 

G4 CATCGATaAAGAACGC^rjCG 

RG R3 9 C ATCGATGAAGAACGC AGCG 

L9 1 S CATCOATCAAaAACGCAGCG 

RGR3 8 CATCaATa^AaAACX}CAGCG 

SLP33 CATCGATaAAGAACGCrJGCG 

9215512 CATCGATCAAGAACGC.-GCG 
Kl CATCGATai-AaAACGCAGCG 

9215513 CATCGATGAAa\ACGC AGCG 
Sdos CATCGATaAAaAACGC-JGCG 

R64 CATCGATGAAGAACGCAGCG 

R56 CATCGATaAAGAACGCAGCG 

R65 CATCGATaAAGAACGCAGCG 

S5 CATCGATaAAGAACGCAGCG 

S5 CATCGATaAAGAACGCAGCG 

Fi CATCGATaAAGAACGCAGCG 

F2 CATCGATGAAaAACGCAGCG 

F3 CATCaATaA-AGAACGCAGCG 

F5 CATCaATClAAGAACGCAGCG 

F7 CATCGATaAAGAACGCAGCG 

F6 CATCGATa-AGAACGCAGCG 

51 CATCGATC^AA.aAACGCAGCG 
A2 CATCGATaAAaAACGCAGCG 
Al CATCaATOAAAAACGCAGCG 

B CATCaATaAAGAACGCAGCG 

AGK CATCGATaAAGAACGCAGCG 

AGN CATCGATaAAGAACGCAGCG 

AGP CATCGATaAAGAACGCAGCG 

AGBO CATCGATGAAGAACGCAGCG 

AGBB CATCGATaAAGAACGC-iOCG 

53 CATCGATGAA.aAACGCAGCG 
AGJ CATCGATaAAaAACGC.^r^G 
Aa:; CATCOATaAAGAACGC.-iiCG 

C 5 C ATCGATaAA-GAACGCAGCG 

C8 CATCGATaAAGAACGCijGCG 

C2 CATCGATGA-AGAA.CGCAC^G 

C4 CATCaATCiAAGAACGCAGCG 

C6 CATCaATGAAGAACGCAGCG 

CI CATCGATaAAGAACGCAGCG 

CI 1 CATCGATaAAGAACGCAGCG 

C 1 2 CATCGATa--AGAACGCAGCG 

C 1 3 C ATCGATGAAGAACGC iJGCG 

52 CATCaATaAAGAACGCAGCG 

54 CATCGATaAAGAACGCAGCG 
AGL CATCGATaAA.GAACGCAGCG 
AG r C ATCGATaAAGAACGC.-OCG 
AGC CATCaATaAAGAACGCAGCG 
AGA CATCGATaAAGAACGC.-OCG 

CAG2 CATCGATaAAGAACGCAGCG 

AGD CATCGATai-AC^ACGCAGCG 

AGE CATCGATGAAGAACGCAGCG 

AGR CATCGATGAAGAACGCAGCG 

AGS CATCaATGAAaAACGC.-OCG 

THARZIANU CATCGATGAAGAACGCAGCG 



AA.^TC-CGATAAfjTAATGTaA 
A-AATGCaATAAGTAATCraA 
AA.:kTGCGATAAGT.AATGrrciA 
A^^^TGCaATA.-X7rAATGrGA 
AA.=iTGCaATAAGT.AATGTaA 
AAATGCaATAAGTAATGrtlA 
AAA.TGCaATAAGTAATGTa\ 
AAATGCaATAAGTAATGTaA 
A-AATGCaATAAGTAATCTCA 
AAA.TGCaATAAGTAATGTGA 
AAATGCaATAAGTAATGTGA 
AAATGCaATAAGTAATGTGA 
AA.-.TGCaAT^-W7rAATtrrGA 
AAATGCaATAAGTAATGTGA 
AAATGCGATAAGTAATGTaA 
AAATGCaATAAGTAATGTaA 
AAATGCaATAAGTAATGTGA 
AAATGCGATAAGTAATGTGA 
AAATGCGATAAGT.^ATGTGA 
A.--ATGCGATAAGT.AATGTGA 
AAATGCGATAAGTAATOTGA 
A.AATGCGATAAGTAATGTGA 
AAATGCGAT.\AGTAATGTGA 
AAATGCaATAAGTAATGTGA 
AAATGCGATAAGTAATGTaA 
AAATGCGATAAGTAATGTGA 
AAATGCaATAAGTAATGTGA 
AAATGCaATAAGTAATGTGA 
AAATGCGATAAGTAATGTGA 
AAATGCGATAAGTAATGTaA 
A.AATGCGATAAGTAATGTGA 
AAATGCGATAAGTAATGTaA 
AAATGCaATAAGTAA.TGTGA 
AA.ATGCGATAAGTAATGTaA 
AAATGCGATAAGTrATGTaA 
AAATGCGATAAGTAATGTGA 
AAATGCGATAAGTAATGTGA 
AAATGCGATAAGTAATGTGA 
AAATGCGATAAGTAATGTaA 
^AAATGCaATAAGTAATGTGA 
AAATGCGATAAGTAATGTGA 
AAATGCaATAAGTAATGTGA 
AAATGCGATAAJGTAATGTGA 
AAATGCGATAAGTAATGTGA 
AAATGCGATAAGTAATGTGA 
A-ATGCaATAAGTAATGTGA 
AAATGCGATAAGTAATGTGA 
AAAT(}CaATAAGTAATGTGA 
AAATGCGATAAGTAATGTGA 
AAATGCGATAOT'AATCTrGA 
AAATGCGATAAGTAATGTaA 
AAATGCGATAAGTAATGTGA 
AAATGCGATAAGTAATGTGA 
AAATGCaATAAGTAATGTGA 
AAACCCGATAAGTAATGTGA 
AAATGCGATAAGTAATGTaA 
AAATGCaATAAGTAATGTGA 
AAATGCaATAAGTAATGTaA 
AAATGCGATAAGTAATGTGA 
AAATGCaATAAGTAATGTGA 



ATTGCAaAATTCAGTaAATC 
ATTGCAaAATTCAOraAATC 
ATTGCAaAATTCAGTGAATC 
ATTGCAr^AATTCAGTa=ATC 
ATTGCAGAATTCAGTaAnTC 
ATTQCAGAATTCAGTGAnTC 
ATTGCAGAATTCAGTaAATC 
ATTGCAGAATTCAGTG^ATC 
ATTGCAaAATTCAGTGAnTC 
ATTGCAGAATTC-OTGAATC 
ATTGCAaAATTCAGTOAATC 
ATTCCAOAATrCAGTCAATC 
ATTC-CAaAATTCAGTOAATC 
ATTGCAaAATTCAGTGAATC 
ATTGCAGAATTCAGTGAATC 
ATTGCAGAATTCAGTaAATC 
ATTGCAGAATTCAGTGAATC 
ATTGCAaAATTCAGTOAATC 
ATTGCAGAATTCAGTGAATC 
ATTGCAGAATTCAGTGAATC 
ATTGCAGAATTCAGTGAATC 
ATTGCAaAATTCAGTGAATC 
ATTGCAGAATTCAGTGAATC 
ATTGCAGAATTCAGTGAATC 
ATTGCAGAATTCAGTGAATC 
ATTGCAGAATTCAGTaAATC 
ATTGCAGAATTCAGTGAATC 
ATTGCAGAATTCAGTGAATC 
ATTGCAGAATTCAGTGAATC 
ATTGCAGAATTCAGTaAATC 
ATTGCAGAATTCAGTGAATC 
ATTGCAGAATTCAGTGAATC 
ATTGCAGAATTCAGTGAATC 
ATTGCAGAATTCAGTGAATC 
ATTGCAGAATTCAGTGAATC 
ATTGCAGAATTC;^JGTGAATC 
ATTGCAGAATTCAGTGAATC 
ATTGCAGAATTCAGTGAATC 
ATTGCAGAATTCAGTGAATC 
ATTGCAGAATTCAGTGAATC 
ATTGCAGAATTC^\GTGAATC 
ATTGCAGAATTCAGTGAATC 
ATTGCAGAATTCAGTGAATC 
ATTGCAGAATTCAGTGAATC 
ATTGCAGAATTCAGTGAATC 
ATTGCAGAATTCAGTGAATC 
ATTGCAGAATTCAGTGAATC 
ATTGCAGAATTCAGTGAATC 
ATTGCAGAATTCAGTaAATC 
ATTGCAGAATTCAGTGAATC 
ATTGCAGAATTCAGTGAATC 
ATTGCAGAATTCAGTGAATC 
AtTGCAGAATTCAGTGAATC 
ATTGCAGAATTCP^jGTGAATC 
ATTGCAGAATTCAGTGAATC 
ATTGCAGAATTCAGTGAATC 
ATTGCAGAATTCAGTGAATC 
ATTGCAGAATTCAGTGAATC 
ATTGCAaAATTCAGTGAATC 
ATTGCAGAATTCAGTGAATC 



ATCaAATCTTTaAACGCACC 
ATCaAATCTTTGAACGCACC 
ATCaAATCTTTaAACGCACC 
ATCaAATCTTTGAACGCACC 
ATCaAATCnTGAACGCA.ee 
ATCaAATCTTTGAACGCACC 
ATCaAA'iCl '1' IXIAACGC.-CC 
ATCaAATCTTTaAACGCACC 
ATCaAATXTTTTaAACGCACC 
ATCaAATCTTTaAACGCACC 
ATCaAATCTTTGAACGCACC 
ATCGAATCTTTGAACGCACC 
ATCGAATCTTraAACGCACC 
ATCaAATCTTTGAACGCACC 
ATCaAATCTTTOAACGCACC 
ATCGAATCTTraAACGCACC 
ATCGAATCTTTGAACGCACC 
ATCaAATCTTTGAACGCACC 
ATCaAATCTTTGAACGCACC 
ATCGAATCTITGAACGCACC 
ATCGAATCTTTGAACGCACC 
ATCGAATCTTTGAACGCACC 
ATCaAATCTTTGAACGCACC 
ATCGAATCTTTGAACGCACC 
ATCGAATCTITGAACGCACC 
ATCGAATCTTTGAACGCACC 
ATCGAATCTTTGAACGCACC 
ATCGAATCTTTGAACGCACC 
ATCGAATCTTTGAACGCACC 
ATCaAATCnraAACGCACC 
ATCaAATCTTTGAACGCACC 
ATCGAATCTTTGAACGCACC 
ATCGAATCTTTGAACGCACC 
ATCGAATCTTTGAACGCACC 
ATCGAATCTTTaAACGCACC 
ATCGAATCTTTGAACGCACC 
ATCGAATCTTTGAACGCACC 
ATCGAATCTTTGAACGCACC 
ATCGAATCTTTGAACGCACC 
ATCaAATCTTTGAACGCACC 
ATCGAATCTTTG/vACGCACC 
ATCaAATCTTTGAACGCACC 
ATCGAATCTTTGAACGCACC 
ATCGAATCTTTGAACGCACC 
ATCaAATCTTTGAACGCACC 
ATCGAATCTTTGAACGCACC 
ATCGAATCTTTGAACGCACC 
ATCGAATCTTTGAACGCACC 
ATCaAATCTTTaAACGCACC 
ATCGAATCTTTGAACGCACC 
ATCGAATCTITGAACGCACC 
ATCGAATCTITGAACGCACC 
ATCGAATCTTTaAACGCACC 
ATCGAATCTTTGAACGCACC 
ATCGAATCTTTGAACGCACC 
ATCOAATCTITaAACGCACC 
ATCGAATCTTTGAACGCACC 
ATCGAATCTTTaAACGCACC 
ATCGAATCTTTGAA-CGC-iCC 
ATCCiAATCnraAACC-CACA 



TTGCGCTC CTTGGT ATTCCT 
TTCCGCTCCTTCXTTATTCCT 
TTGCGCTCCTTGGTATTCCT 
TTGCGCTCCTTCGTATTCCT 
TTGCGCTCCTTGGTATTCCT 
TTGCGCTCCTTGGTATTCCT 
TTGCGCTCCTTGGTATTCCT 
TTGCGCTCCTTGGTATTCCT 
TTGCGCTCCTTGGTATTCCT 
TTGCGCTCCTTGGTATTCCT 
TTGCGCTCCTTGGTATTCCT 
TTGCGCTCCTTGGTATTCCT 
TTGCGCTCCTTGGTATTCCT 
TTGCGCTCCTTGGTATTCCT 
TTGCGCTCCTTGGTATTCCT 
TTGCGCTCCTTGGTATTCCT 
TTGCGCTCCTTGGTATTCCT 
TTGCGCTCCTTGGTATTCCT 
TrGCGCTCCTTGGTATTCCT 
TTGCGCTCCTTGGTATTCCT 
TTGCGCTCCTTGGTATTCCT 
TTGCGCTCCTTGGTATTCCT 
TTGCGCTCCTTGGTATTCCT 
TTGCGCTCCTTGGTATTCCT 
TTGCGCTCCITGGTATTCCT 
. TTGCGCTCCTTGGTATTCCT 
TTGCGCTCCTTGGTATTCCT 
TTGCGCTCCTTGGTATTCCT 
TTGCGCTCCTTGGTATTCCT 
TTGCGCTCCTTGGTATTCCT 
TTGCGCrCClTGGTATTCCT 
TTGCGCTCCTTGGTATrCCT 
TTGCGCTCCTTGGTATTCCT 
TTGCGCTCCTTGGTATTCCT 
TTGCGCTCCTTGGTATTCCT 
TTGCGCTCCTTGGTATTCCT 
TTGCGCTCCTTGGTATTCCT 
TTGCGCTCCTTGGTATTCCT 
TTGCGCTCCTTGGTATTCCT 
TTGCGCTCCTTGGTATTCCT 
TTGCGCTCCTTGGTATTCCT 
TTGCGCTCCTTGGTATTCCT 

TTGCGCTCCTTGGTATTCCT 
TTGCGCTCCTTGGTATTCCT 
TTGCGCTCCITGGTATTCCT 
TTGCGCTCCTTGGTATTCCT 
TTGCGCTCCTTGGTATTCCT 
TTGCGCTCCTTGGTATTCJT 
TTGCGCTCCTTGGTATTCCT 
TTGCGCTCCTTGGTATTCCT 
TTGCGCTCCTTGGTATTCCT 
TTGCGCTCCTTGGTATTCCT 
TTGCGCTCCTTGGTATTCCT 
TTGCGCTCCTTGGTATTCCT 
TTGCGCTCCTTGGTATTCCT 
TTGCGCTCCTTGGTATTCCT 
TTGCGCTCCTTGGTATTCCT 
TTGCGCTCCTTGGTATTCCT 
TTGCGCCCGCCAOTATTCTG 
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■^^^^^^ ^^^^^^^^^^^ 

si=^^ 

^= iiiii 1=^; mm 

=11 1=^ 

TGGAGCATCCCrorrrGAGT ATCATGAAATC-TOA^CT 4T?Sm---a^C^ TnTTGaAGCrT-TXTrrCCAGC 

TCGAGCATCCCTGTrrGAGT ATCATCAAATC-TOA.^GT -^ZS^TE^ ^^TGCTTT-- -C^ TATOXiAGCTT-TCrTGC.^^ 

TGGAGCATCCCTGTrrGAGT ATCATGAAATC-T^AA^CT ^lE^IE^ ^^JS^m-- - -CA^^ TATTGaA^GCTT-'ICTrGCAGC 

TGa^GCATCCCTGTrrGAGT ATCATGAAATX:-TTCAA.^GT ^I^^IJIS^ ^^?5?iTT-- -S^C^ TATrGGAGCT-'ICTrGLVrJC 

TGGAGCATClCCIXrnTGAGT --^'ATGAAATC-TTC.nA^CT ^TSJSJSStA^J^tA - "^^^:^'CG ATrTGa=.GG--'mTCCAa^ 

TCGAGCATCCCICTTTGAGT ^TCATCAAATT-CTCAAAGT '^ZSZflSI^?^^ ^TCCT-T- - - -TGOTT-GG ATTTGGAGG- -TTTTGCAGA 
W^GCATCCCIOTrrGAGT -'^'TCATCAAATT^-CTCAA^CT -^^A^^i^S^^??^^ Jl^-T--- -TGCTTr-GG AriTGaAGG--TrrrGCA.^.n 

Tcr^GCATGCCTCrrrGAGT -•i^AT?*^^!!-^^^?^ '^t^^J^SS?^™^ c?GGr-T----TCcrrr-GG ATITGGAGG--TrrrGCAa^ 

TCIGAGCATCCCIXTTTTGAG-T -'TCATGAAATT-CTCAAA^ '^t^SZIf^^I^ S^-T- -- -TGCTIT-GG ATTTCanGG- - TTTTGC AG.^ 

TCGAtXATtxcnrrrrGAGT ATCATaAAAi^-CTCA^GT -AA^I^IZIESZ^ SSCT-T----'rccrrr-GG A'rrrGGAGG--TrrrGCAAA 

TCX^GCATrKrCTGTITGAGT ATCATGAAAT^^AAACT ^TCCT-T- ---TCCT^ ATTTCGAGG- -TrrrGCA.A.=. 

TGGAGCATGCCTGTITCAGT -^ATGAAATpCTC^Gr --^^JSZ^^^ CTGOT-T----TCCTrr-GG ATITGa:^GG--TrrrGCAA.=. 

TGGAGCATCCCrGrrTCAGT ATC ATtSAAATT- CTCAAACT --^1121^5^ S^-T-.-TCCrir-GG ATTTGCAGG- - TTrTCC Aa^ 

T<r^C^CATCCCIXrrrrGAGT ^TCATanAATT-CTCA^CT SgScTT---1OT^-GG ATTrGGAGGCCT-TTGCAa^. 

TGGAC^ATGCCTGTrrGAGT r^S^^^S^ ?^§Sr^---TGC^ TATrGGAGa=vTTATrGC.^ 

TGa^GCATGCCTGTrrGAGT ATCATCAn^TC-TTCAAAGT ?-^^EZ^^ TTCGr-TC" -TOrrTT-GG TATrGGAGGATTATTGCAGC 

Tr^GC^TGCCTGrrTGAGT ATCAT^TX.-TTCA^GT SSSct^-T-I^C^ ATTTCGAGGCCT-TTGCAa^ 

TCGAGCATCCCICTTTGAGT ^TCATCAAAIT- CTCAP^CT ^^^-51^!^:?^^?^^^ S^SSt- - --gS^T ATrrGGAGGICTT- -GCAG.^ 

TGGAGC^TGCCrOTITGAGT ATCATGAAATT-CTCAA^CT -^^'^ZI^^ SSScTT- - - -GCTTT-GG ATrrGGAGGTCTr-GC-^- 

TGGAGC^TGCCTGTTrGAGT -TCATGAAATT- CTC AA^CT ^f/SH^Zg^^Z^^ SgSSt-- - -GCTTT-GG AnTGGAGGrrCTT-GCAa^ 

TCGAGCATGCCTGTrrGAGT ATCATGAAATT- CR^^^OT TTGGCcS- - -GCT^ ArTTGG^.GCTreTr- -GCAa-. 

TGGAGCATCCCrGTITGAGT ATCATGAAITT-CTCAAAGT ^^-SZ^SJ^^ ?SS^^l3---TTl.'rrriGG ACTOXX^GG- --ICTTGC^Xl^ 

TGGAGCATGCCrorrrGAGT ATCATGA-^ATT-CTCAAAGr -^^EZEE!^ ^^l^—~GCrrr-GG ACITGaAGCTCT-'riGC.^^. 

TGGAGCATCCcrGTrrGAGT ATCATGAAArr-CTCAAACT ^^^'^Z^Jg^^^ crGSTTT----GC^ ACrrcaAGGTCT-rrGC-^a^. 

TCGAGCATGCCTG-TTTGAGT ATCATGAAATT-CTCAAAGT ^TISJ^^C^to^^ tJcCT-TCC-TTA^-GG TArrGaAGCTATATTO-VGC 

^-^Fi-is -E^SiS 
^^^S^S ^-^g-iSi -^^sSiS J^^^::^^. 

CGGAGC^CGCCTGrrraAGT ATCAlGAP^TT-CTCAA^a: S5g?S---TTCC^ ATTrGGAGGICT-TGCACA 

CGGAGC^CGCCTGTTrGAGT ATCATCAAATT-CTCAAAO, "^SSSZH^^ ?rGST---'riS^ ATTrGGAGCTCT-TGC^^lA. 

CGaAGC=.CGCCrGTrrGAGT ATCAIGAAATT-CTCAAAG^ '^^SJS^^^S^A^l^S^ S^S-T- - - -TTATriTCG ACTTGGGGG— TCT-GC^JG-A 

TGGAGCATGCCTGTrrGAGT ATCATGAAATT-ATCA^CT '^Z^IZZ^ TT^T-T----'rr^rriTCG ACTrGGGGG-ICr-GCAOA. 

TGGAGCATCCCTGTrrGAGT ATCATGAAATT-ATCA^^GT "t^^^ZIZ^ ^S^^TTT- - - "ItTITCG ACTrGaAGCTTCr- -GC^r.^ 
. TGCAGa^TCCCrGTrrGAO-T ATCATGAAATC-CTCAAAAT ^^^-CTT^TOT^ S^T^TCG ACrTGaAGCrCT— GC.^A 

TGGAGCATGCCTGTrrGAGT ATCATCA.^.^TC-CTCAA^ ^rSS^SS^tSSS SS^T^TT ATTTTCG ACTrGGAGGTrCT— GCAG.^ 

: TCGAGCATCCCTGTnr^GT ATCATGAAATC-CTCAAAAT -^^"S^IEE?^ ^TC^tSt-- - -Al^TT-GG ACTTGaAGG-TCT— GC.^. 

. TGGAGCATCCCTGrrrGAGT ATCATCAAATC-CIC^^T ^I-gZ^SI^t^^^g; ^^S^TT^T-- ---ATTTrGG ACTTGGAGGTCT- -GCAG.A 

; TGaAGCATGCCTGTTrGAGT ATCATa^AATC-CTC^T ^Z^'EZZ^S?^ ^T^TTTT ATTTIGG ACTTCGAGGnCT— GCAG-A 

) TC^G.^GCATGCCTG-rrrGAGT ATCATGAAATC-CTCA^T ^I^-CTTTOT^ ??5StSt ATrTTCG ACITGGAGGTCT— GC^IIA 

: TGGAC^CATGCCTGTTrGAGT ATCATGAAATC-CTCAm^^ JJtoStJ TT-GG ACTCTGCTCTrCT COT 

TCXlAGCACGCCTGTTTGACTr ATCATGAAACT-CTCAAGCT ^^^^SEZZ StStTA -TT-GG ACTCTGCTGrCT OCT 

'^^^^a:^^^ ^SSSSS^ SSCGGGA^CCCX.. GACGGa^^XCGGCCCCG.. 



L : : 6 TT-CACACX:C-TGCTCCTCT 
TT-CACACGC-TGCTCCTCT 

92tl7 TT-CACACGC-TGCTCCTCT 

NO TT-CACACCJC-TGCTCCTCT 

C'3(16 TT-CACACGC-TGCTCCTCT 

GUI TT-CAC^CGC-TGCTCCTCT 

PJl : ti: TT-CACACGC-TGCTCCTCT 

G4 TT-CAC^CGC-TGCTCCTCT 

RGR39 TT-CACACGC-TGCTCCrCT 

L9 1 S TT-CACACGC-TGCTCCTCT 

RGR3 a TT-CACACGC -TGCTCCTCT 

SLP3 3 TT-CACACGC-TGCTCCTCT 

9215512 rr-CACACGC-TGCTCCrCT 
Kl TT-CACACGC-TGCTCCTCT 

9215513 TT-CAC^CGC-TGCTCCTCT 
Sdos TT-CAC\CGC-TGCTCCrCT 

R6 4 TT-CACACGC -TGCTCCTCT 

R56 TT-CACACGC-TGCTCCTCT 

R65 TT-CACACGC-TGCTCCTCT 

55 TTTCACAGTC-TGCTCCTCT 

56 TTTCACAGTC-TGCTCCTCT 
Fl TTTCACAGTC-TGCTCCTCT 
F2 TTTCACAGTC-TGCTCCTCT 
F3 TTTCAOVOTC -TGCTCCTCT 
F5 TTTCACAGTC-TGCTCCTCT 
F7 TTTCACAGTC-TGCTCCTCT 
F6 TTTCACAGTC-TGCTCCTCT 

51 TrrCACAGTC -TGCTCCTCT 
A2 TGTCACAGTC-TTXrrCCTCT 
Al T-TCACAC-C-TGCTCCTCT 

B T-TCAC-ACC -TGCTCCTCT 

AGK TGTCAC^GTC -TGCTCCTCT 

AGN TGTCACAGTC -TGCTCCTCT 

AGP TCTTCACAGTC-TGCTCCTCT 

AGBO TGTCACAGTC -TGCTCCTCT 

AGBB TCTTCACAGTC -TC^CTCCTCT 

53 TGTCACAGTC -TGCTCCTCT 
AGJ TTTCAC-GTC-TGCTCCTCT 
ACr; TTTC AC - CTTC -TGCTCCTCT 

C5 TTTC-ACACCC-TCXTCCTCT 

C8 TTTCACACCC-TGCTCCTCT 

C2 TT-CACAAC- -TGCTCCTCT 

C4 TTTCACACCC -TGCTCCTCT 

C6 TT-CACACC- -TGCTCCTCT 

CI TT-CAC-CTC-TGCTCCTCT 

Cll TT-CAC-CTTC-TGCTCCTCT 

C12 TT-CAC- GTC -TGCTCCTCT 

C 1 3 TT-CAC- GTC -TGCTCCTCT 

52 T-TCAC-GTC-TGCTCCCCT 

54 T-TCAC-GTC-TGCTCCCCT 
AGL T-TCAC-GTC -TGCTCCTCT 
AGI T-TCAC-GTC -TGCTCCTCT 
AGC T-TCAC-GTC -TGCTCCTCT 
AGA T-TCAC- GTC -TGCTCCTCT 

CAG2 T-TCAC- GTC -TGCTCCTCT 

AGD T-TCAC- GTC -TCCTCCTCT 

AGE T-TCAC-CTC -TGCTCCTCT 

AGR TC7TC-TGG AAC - GGCTGGTCT 

AGS TGTCrrGG AAC - GGCTGGTCT 

THARZIANU ATACAGTGGCGGTCTCGCCG 



TTGTTC A - TT AGCTGGATCT 
TTGTTCA -TTAGCTGGATCT 
TTGTTCA -TTAGCTGGATCT 
TTC/rTCA -TTAGCTGGATCT 
TKTTTCA -TT.-GCTGGATCT 
TTGTTCA-TTAGCTGGATCT 
TTGTTCA-TT.^y^CTGCnTCT 
TTGTTCA-TTAGCTGGATCT 
TTGTTCA-TTAGCTGaVTCT 
TTGTTCA - TTAGCTGGATCT 
TTGTTCA-TTAGCTGGATCT 
TTCITC A - TTAGCTGGATCT 
TTGTTCA-TTAGCTGGATCT 
TTGTTCA-TTAGCTGGATCT 
rrCTTCA -TTAGCTGGATCT 
TTGTTCA-TTiXSCTGGATCT 
TTGTTCA-TTAGCTGGATCr 
TTGTTCA -TTAGCTGGATCT 
TTGTTCA - TTAGCTGGATCT 
TAAATAAATTAGCTGGATCT 
TAAATAAATTAGCTGGATCT 
TAAATAAATTAGCTGGATCT 
TAAATAAATTAGCTGGATCT 
TAAATAAATTiXSCTGGATCT 
TAAATAAATTAGCTGGATCT 
TAAATAAATTAGCTGGATCT 
TAAATAAATTAGCTGGATCT 
TAAATAAATTAGCTGGATCT 
TAAATAAATTAGCKXiATCT 
TTG -TGCATTAGCTGGATCT 
TTC - TGCATTAGCTGGATCT 
TAAATAAATTAGCTGGATCT 
TAAATCAATTAGCTGGATCT 
TAAATGAATT AGCTGGATCT 
TAAATATATTAGCTGGATCT 
TAAATATATTAGCTCGATCT 
TAAATATATTAGCTGGATCT 
TAAATCCATTAGCTGGATCT 
TAAATGCATTAGCTGGATCT 
TT- GTTC ATTAGCTGGATCT 
TT- GTACATTAGCTGGATCT 
TT-GTrcATTAGCTGGATCT 
TT- GTTC ATTAGCTGGATCT 
TT- GTGCATTAGCTGGATCT 
TTTGTTCATTAGCTGGATCT 
TAAATGCATTAGCTGGATCT 
TAAATGCATTAGCTGGATCT 
TAAATGCATTAGCTGGATCT 
TAAATTCATTAGCTGGATCA 
TAAATTCATTAGCTGGATCA 
TAAATTCATT AGCTGGATCT 
TAAATTCATTAGCTGGATCT 
TAAATGTATTAGCTGGATCT 
TAAATGTATTAGCTGGATCT 
TAAATGTATTGGCTGGATCT 
TAAATGTATTAGCTGGATCT 
TAAATGTATTAGCTGGATCT 
CAAATGTATTAGCTGGTCCT 
CATATGTATTAGCTGGTCCT 
CA- - -GCCTCTCCTC- --CG 



CAGTGTTATGCrTGGTTC - C 
CA.GTGTT ATGCTTGGTTC - C 
CAGTGTTATGCTTGGTTC -C 
CAGTGTTATGCTTG(7rTC -C 
CM rrGTTATGCTTGGTTC - C 
CAGTGTTATGCTTGGTTC - C 
G^CTGTTATGCTTGGTrc -C 
CAGTGTTATCJCTTGGTTC -C 
CAGTGTTATGCTTGGTTC -C 
C^iGTGTTATGCTTGGTTC - C 
CAGTGTTATGCTTGGTTC - C 
CAGTGTTATGCTTGGTTC - C 
OVGTGTTATGCTTCGTTC -C 
CAGTGTTATGCTTGGTTC-C 
CA-TGTTATGCTTGGTTC-C 
CAGTGTTATGCTrGGTTC - C 
CAGTGTTATGCTTGGTTC-C 
CAGTGTTATGCTTGGTTC -C 
CAGTGTTATGCTTCGTTC -C 
CAGTAT - ATGCTTGGTTC -C 
CAGTAT-ATGCTTGGTTC-C 
CAGTAT- ATGCTTGGTTC -C 
CAGTAT- ATGCTTGGTTC -C 
C\GTAT-ATGCTTGGTrc-C 
CAGTAT-ATGCTTGGTTC--C 
CAGTAT- ATGCTTGGTTC -C 
CAGTAT- ATGCTTGGTTC -C 
CAGTAT- ATGCTTGGTTC -C 
CAGTAT-ACACTTGGTTC-C 
CAGTGTTATGdTGGTTC -C 
CAGTGTTATGCTTGGTTC-C 
CAGTATAC - ACTTGGTTC-C 
CAGTATAC- ACTTGGTTC - C 
CAGTATAC - ACrTGGTTC -C 
CAGTATAC - ACTTGGTTC - C 
CAGTATAC - ACTTGGTTC' -C 
CAGTAT- ACACTTGGTrc -C 
CAGTATAT-GCTTGGTTC -C 
CAGTATAT- GCTTGGTTC-C 
CAGTGTTATGCTTGGTrc-C 
CGGTGTTATGCTTGGTTC -C 
CAGTGTTATGCTTGGTTC -C 
CGGTGTTATGCTTGGTTC-C 
CAGTGTTATGCTTGGTTC-C 
CAGTATAAAACTCGGTTC-C 
C- -TATAAAAC-CGGTTC-C 
C- -TATAAAAC-CGGTTC-C 
C- -TATAAAAC-CGGTTC-C 
CTGTAG-A- -CTCGGTTC-C 
CTGTAG-A — CTCGGTTC-C 
GTATGAA- - -CTCGGTTC-C 

GTATGAA CrC"GGTTC - C 

CAGTGAA CTCGGTTC-C 

CAGTGAA CTCGGTTC - C 

CAGTC AA- - -CTCGGTTC - C 
CAGTGAA- - -CTCGGTTC -C 

CAGTGAA CTCGGTTC -C 

TGTTGTGGG ATTTGGITC -T 
TGTTGTGGGATTTGGTTC -T 
CAGTAGTTTCjCACAACTCCX 



ACTC-GGCGTGATAAATTATC TATCGCTCA-Ga-.CACCT- - 
AjCTCGGCGTGATAAATTATC TATCGCTCA-GCACACCC- - 
ACTCGGCGTGAT.VuATTATC TATCGCTGA-C^i.CACCT- - 
ACTCGGCGTCATAA.-.TrATC TATCGCTCA-Ga^CACCT- - 
ACTCGGCGTCATA.i»ATTATC TATCGCTCA-Ga^CACCC- - 
ACTCGGCGTGATAAATTATC TATCGCTGA-Ga^vC-CCT- - 
ACTCGGCGTa'^TA^JkrrATC TATCGCTGA-GCACACCC- - 
ACTCGGCGTCATAAATTATC TATC'GCTGA-Ga^CACCC- - 
ACTCGGCGTGATAAATTATC TATCGCTGA-CKiACACCC- - 
ACTCGGCGTGATAAATTATC TATCGCTGA-GGACACCC- - 
ACTCGGCCTGATAA-ATTATC TATCGCTaA-Ga^^CACCC- - 
ACTCGGCGTGATAAi.TTATC TATCGCTGA-Ca^CACCC- - 
ACTCGGCGTGATAAATTATC TATCGCTCA- GG.=.CACCCC - 
ACTCGGCXTTGATAAATTATC TATCGCTaA-Ga=.CACCCC- 
ACTCGGCGTGATAAATTATC TATCGCTGA-GG.-.C.-CCCC- 
ACTCGGCGTGATAA.-.TTATC TATCGCTGA-Ga-.CACCCC- 
ACTCGGCGTGATAAATTATC TATCGCTCA- Ga^C ACCCC - 
ACTCGGCGTGATAAATTATC TATCGCCa\-CiGr.C.nCCCC- 
ACTCGGCGTGATAAATTATC TATCGCCGA- GO^C ACCCC - 
ACTCGCX-GTCATAAGT- ATC ACTCGCTC,A- GCAC AC - -T- 
ACTCGGCGTCATAAGT- ATC ACTCGCTGA- Ca^CAC - - T - 
ACTCGGCGTCATAAGT-ATC ACTCGCTGAAGGACAC- -T- 
ACTCGGCGTGATAAGT-ATC ACTCGCTCA-GGACAC- -T- 
ACTCGGCGTCATAACrr- ATC ACTCGCTGA- Ga^CAC - - T- 
ACTCGGCGTGATAACr- ATC ACTCGCTGA- Ga^CAC - - T - 
ACTCGGCGTGATAAGT- ATC ACTCGCTGA- GanCAC - -T- 
ACTCGGCGTGATAAGT- ATC ACTCGCTGA- Ga^CAC - -T - 
ACTCGGCGTGATAAGT- ATC ACTCGCTa=v- Ga^CAC - -T- 
ACTCGGCGTGATAAGT- ATC ACTCGCTGA- Ga-.CAC - - T- 
ACTCAGCGTGATA.AGTTATC TATCGCTXiA-GG.-.CACCCT- 
ACTCAGCGTGATAAGTTATC TATCGCTCA- GGACACCCT- 
ACTCGGCGTGATAAG-TATC ACTCGCTGA-GG.-.CAC- -T- 
ACTCGGCGTGATAAG-TATC ACTCGCTGA- GC-CACTCTT- 
ACTCGGCGTGATAAG-TATC ACTCGCTGA- GG.-CACTGT- 
ACTCGGCGTGATAAG-TATC ACTCGCTGA-GaiC.-CTGT- 
ACTCGGCGTCATAAG-TATC ACTCGCTGA- Ga-.CACTGT- 
ACTCGGCGTCATAAGT- ATC ACTCGCTGA- GQ-CAC - -T- 
ACTCGGCGTCATAAG - TATC ACTCGCTGA- Ga-.CACCTT- 
ACTCGGCGTGATAAG-TATC ACTCGCTa=v-GGCC.-JCTGT- 
ACTCGGCGTCATAAGITATC TATCGCTCA- GQ-.C AC - - T - 
ACTCGGCGTCATAAGTTATC TATCGCTCA- Ca^CAC - -T - 
ACTCGGCGTGATAAGT- ATC TATCGCTaA-Ga-.C.iC--T- 
ACTCGGCGTGATAAGTTATC TATCGCTGAAGaACAC--T- 

ACTCAGCGTCATAAGITATC TATCGCTCA- Ga^.C AC 

ACTCGGCGTCATAAGT-ATC ACTCGCTGA- Ca^CACTCT - 
ACTCGGCGTGATAAGT- ATC ACTCGCTGA- Ga^CACTCT- 
ACTC-GGCGTCATAAGT-ATC ACTCGCTGA- Ga^CACTCT- 
ACTCGGCGTGATAAGT-ATC ACTCGCTGA- GG^CACTCT- 
ACTCGGCGTGATAACT - ATC ACTCGCTGA- GC»AC AC - - T - 
ACTCGGCGTGATAAGT -ATC ACTCGCTGA- GGnCAC - -T - 
ACTCGGCGTGATAAG-TATC ACTCGCTGA-Ga-.C--JCTGT- 
ACTCGGCGTGATAAG-TATC ACTCGCTGA- GC^C.-CTGr- 
ACTCX3GCCTGATAAG-TATC ACTCCCTCA-GaivCACTGT- 
ACTCGGCGTGATAAG-TATC ACTCGCTGA- GO^CACTGT - 
ACTCGGCGTCATA.AG-TATC ACTCGCTCL\-Ga-.CiCTGT- 
ACTCGGCGTGATAAG - TATC ACTCGCTGA- GG.-.C ACTXTT - 
ACTCGGCGTGATAAG-TATC ACTCGCTGA-Ga^CACTGT- 
ACTCAGCXTTCATAATCTATG AC -CGCTGA-Ga-.CATCTT- 
ACTCi GCGTCATA.ATCTATG AC -CGCTa\- GCACATCTT- 
ACCGGGAGCGCGGCGC - GTC CACGTCCGTAAiACACCCAA 
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69 
207 

3115 
GUI 
AJilOl 
G4 
RGR3 9 
L91S 
RGR38 
SLP33 
92155:2 
Kl 

9215513 
Sdos 
P. 6 4 
R5iS 
R65 
S5 

Fi 
F2 
F3 
F5 
F7 
F6 
SI 
A2 

B 

AGK 
AGN 
AGP 
AGBO 
AGBB 
S3 
AGJ 
AGG 
C5 
C8 
C2 
C4 
C6 
CI 
Cll 
CI 2 
CI 3 
S2 
S4 
AGL 
AG I 
AGO 

AaA 

CAG2 
AGD 
AGE 
AGR 

At;s 

TR-\HZIA>nj 



GATAAAAAG - GG - -TG(5CCA 
GATC-; AAAAG - GT - -TGGCC A 
GATA.\A.---.G-t7r- -TGGCCA 
G ATAAAA-^f ; - GG - - TGGCCA 
a^TA.AA=unG-GT- -TG^JTCA 
GATAAAAAG -CTT- -TGGCCA 
GGTrJ^A-.A^-C-G- -TGGCCA 
GGTAA.^-\.AA-C-G- -TGGCCA 
GGTAAA.^An-GG- -TGGCCA 
GGTAAAA.A\-GG- -TGGCCA 
GGTAAAAAA-GG- -TGGCCA 
GGTAAAAA^.-OG- -TGGCCA 
GGTAAAAA.^ - GGG - TGGCCA 
GGTAAAAAA - CiGG - TGGCCA 
GGTAAAAAA-GGG-TGGCCA 
GGTAAAAAA-GG--TGGCCA 
GGTAAAAAA-GGGGTGGCCA 
aTTAAAA-iA - GGG - TGGCCA 
C^CJTAAAA-Ai. - -TGGCCA 

-GTAAAA GG-TGGCCA 

-GTA."-^A- - -GG - -TGGCCA 

-GTAAAA GG- -TGGCCA 

- G-TAA-A^v - - -GG - -TtXXCA 
-GTAAAA-- -C^G- -TGGCCA 
- GT--JvA.iv - - - GG - -TGGCCA 
-GTAAAA- - -GG- -TGGCCA 

-gtaaa;^.-- -cx:j--tggcca 
- gtaaaa --- gg - -tggcc a 

GTTAAAAA- -GG- -TGGCCA 
GTTAAA.\AGGGG- -TGGCCA 
GTTAAAAAGGGG- -TGGCCA 
GT-TAAAAA- - GG - -TGGCCA 
- - - AAAA- - - GG - -TGGCCA 

AAAA- - - GG - -TGGCCA 

AAAA---GG--TGC^CA 

AAAA-- -GG- -TGGCCA 

-GTAAAAA.i - GG- -TGC^CCA 

AA.-AA G- -TGGCCA 

AAAA - - - CX: - -TGGCCA 

TGTAAAAT- - GG - -TGGCCA 
GTTAAAAAT - GG - -TGGCCA 
TGTAAAAA- - GG - -TGGCCA 
TGTAAAAAT- GG - -TGGCCA 
TGTAAAA - - - GG - -TGGCCA 
TGTAAAAG- -GG- -TGGCCA 
TGCAAAAG- - GG - -TGCCCG 
TGCAAAAG- -GG- -TGGCCG 
TGCAAAAG- - C-G- -TQOZCG 
QGrT.\AA,^J^^ - AG - -TGGCC - 
CGTAAA^A-i - AG - -TGC^C - 

AAAA- - - GG - -TGGCC - 

AAAA- - - GG - -TGC>CC - 

AAAA- - - ex; - -TGGCC - 

AAAA---GG--TGGCC- 

-GG--TGGCC- 
-GG- -TGGCC - 

AAAA GG - -TC]GCC- 

TCC>GGATCiGCCA 

TCCjC^GATGGCCA 

CTTXTC:iA.AiTGT- -TGACCT 



AGGTA.-JvTGC — agat- - - ACCGCTCCTAATA-GTCC 

AGGT.A-J^.TGC - - - .-£iAT- - - ACCGCrCCTAATA -GTCC 

AGGT/^AA-TGC - - - iXl^T- GA - - ACCGCTCCT iJ^TA-CTCC 

AGGT--_-_-.TGC ija^T- GA - - ACCGCTTCTAATA -GTCC 

AGGTA,=u-.TGC AO^T- GA - - ACCGCTTCTAATA- GTCC 

AC>GT--AATGC - - - AGAT- Cn - - ACCGCTCCTAATA-GTCC 

AGGTA.--.\TGC AG.-.T- GA - - ACCGCTTCTAATA -GTCC 

aggt;^a-^tgc - - - .-nAT- aA - - ACCGcrrcTAATA-trrcc 

AGGT.^-A.TGC Aa^vT-GA --ACCGCTTCTAATA -GTCC 

AGGTAA--.TGC AGAT- GA - - ACCGCTTCTAATA -GTCC 

AGGTA.--ATGC - - - AGAT-a^ - - ACCGCTTCTAATA -GTCC 

AGGT.AA.ATGC AGAT- - - ACCGCTTCTAATA-CTCC 

AGGT iAu^TAC - - - AGAT- GA - - ACCGCFrCTAATA-GTCC 
AGGTA^-ATAC - - - AGAT- an - - ACCGCTTCTAATA -GTCC 

AGGTAAATAC Aa^T-GA --ACCGCTTCTAATA -GTCC 

AGGTAAATAC AT- GA - - ACCGCTTCTAATA- GTCC 

AGGTA.-A.TGC - - - AAAT- GA - - ACCGCTTCTAATA -GTCC 

AGGTAAATGC Aa^T-aA - - ACCGCTTCTAATA -GTCC 

AGGTAAATGC AGAT- GA - - ACCGCTTCTAATA-GTCC 

- -GGAA.i TGC - - - AGAT- GA - - ACCGCTTCTAATA-GTCC 

- -GGA.ikATGC .^kT- Gn - - ACCCKTrTCTAATA-GTCC 

- -GGAAATGC- - - AAAT- AA - - ACCGCTTCTAATA-GTCC 

- - GOA-J^TGC AGAT- GA - - ACCGCTTCTAATA-GTCC 

- -Ga^AATCC - - - AGAT- GA - - ACCGCTTCTAATA-GTCC 

--GG-i^ATGC .-J3AT-GA --ACCGCTTCTAATA-GTCC 

--GGAAATGC AGAT-GA --ACCGCTTCTAATA-GTCC 

--GGAAATGC AGAT-GA --ACCGCTTCTAATA-GTCC 

--GGAAATGC AGAT-GA - - ACCGCITCT AATAAGTCC 

GG- - CA-ATGC rJGAT- GA - - ACCGCTTCTAP'T A- GTCC 

AGGTAAATGC - - - AGAT-GA - - ACCGCTTCTAATA-GTCC 
AGGTAAATGC - - - .-jGAT-GA - - ACCGCTTCTAATA -GTCC 

- -GGC---ATGC- - - AGAT-GA - - ACCGCTTCTAATA-GTCC 

- -GGCAATGC - - - AGAT-GA - - ACCGCTTCTAATA-GTCC 

- -GGCAATGC AGAT-GA - - ACCGCTTCTAATA-GTCC 

- -GGCAATGC - - - AGAT-GA - - ACCGCTTCTAATA-GTCC 

- -GGCAATGC - - - AGAT- CxA - - ACCGCTTCTAATA-GTCC 

- -GGCAATACT- - AaAT- - - ACCGCTTCTAATA-GTCC 

- -GGAAATAC - - - .-jGAT- GA - - ACCGCTTCTAATA -GTCT 

- -GCAAATAC - - - AGAT- CA C- ACCGCTTCTAATA -GTCT 

AGGTAS-J^CGCA GAT- GA - - ACCGCmTTAACA-GTCC 

AGGTA-i-ACACA- -TGATTGA - - ACCGCTTCTAACG -GTCC 
AGGTAAACGCA - - -CAT- GA - - ACCGCTTCTAACCAGTCC 
AGGTAAACGCA- - TGATTGA - - ACCGCTTCCAACG -GTCC 

AGGTAAATGCA CAT-GA - - ACCGCTTCTAATA-GTCC 

AGATTCATGCA- - -GAT-GA - - ACCGCTTCTAACG -GTCT 
-GATTCATGCA CGA - - ACCCXTTCTAACG-GTCT 

- GATTCATX^^A CGA - - ACCGCTTCCAACG -GTCT 

-GATTCATGC^v TGA - - ACCGCITCTA-ACG -GTCT 

- -GGGATTACT- - AG A - - GA - - ACTGCTTCTAAT A -GTCC 

- -GGGATTACT- - AGA- -GA - - ACTGCTTCTAAT A -GTCC 

- -GGCATTATT AT- GA — ACCGCTTCTAATA-GTCC 

- -GGCATItlAT TAT- r»A - - ACCGCTTCTAATA-GTCC 

- -GGGGTTGC - - - AGAC- CA - - ACCGCTTCTAATC -GTCC 

- -GGGGTTGC - - - .-riAC-CA - - ACCGCTTCTAATC -GTCC 

- -GGGGTTGC AGAC-GA - - ACCGCTTCTAATC -CTTCC 

- -GGGGTTGC - - - ACAC- C>A - - ACCGCTTCTAATC -GTCC 

--GGGGTTGC AGAC-GA —ACCGCTTCTAATC -GTCC 

--AGCC-TGCCT TGGG CTGCTTCTAA CTC 

--AGCC-CGCCT TGGG CTGCTTCTAA CTC 

CGCATCAGGT AGGAA TACCCGCTGAACTTA - - AGC 



ATTGGACTTGG - AC.-.AC.AA- 
ATTCjGACTTGG - AC.-AC.-.A- 

ATTG- ACTTGG - ACAA 

ATTG - ACTTGG - AC-A 

ATTG - ACGTGG - AC AA. 

ATTG- ACTTGG - ACAACA.A - 

ATTC - ACTTGG - AC A-A 

ATTC - ACTTC iG - AC A A 

ATTC - ACTTGG - AC AA 

ATTC - ACTTGGGAC AA. 

ATTC - ACTTGG - AC AA 

ATTC - ACTTGGCAC AA 

ATGG - ACTTGG - AC A-A 

ATTG - ACTTGG - AC A.A 

ATGG - ACTTGG - AC AT 

ATTG - ACTTGG - AC A,A 

ATTG- ACTTGG - ACAA 

ATTG- ACTTGG- ACAA 

ATTG - ACTTGG - ACAA 

ATT A - AATTGG - AC AAA 

ATTA-AATTGG-ACAA-A 

ATTA- AATTGG - AC A.--A 

ATTA- AATTGG - AC AAJv 

ATTA- AATTGG - ACAAA 

ATTA- AATTGG - ACAAA 

ATTA- AATTGG - AC AA_A 

ATTA- AATTGG - ACAAA 

ATTA - AATTGG - ACAAA 

ATTGaA CTTG G - AC A-AT - - 
ATTGGA.CrTGG - ACAATA- - 
ATTG- ACTTGG - ACAATA- - 

ATTG - ACTTGG - ACAAA 

ATTC- ACTTGG - AC AATT- - 
ATTC - ACTTGG - AC AATT- - 
ATTC - ACTTGG - AC AATT- - 
ATTC - ACTTGG - ACAATT- - 

ATTC - ACTTGG - ACAA 

ATTA- AGTTAG- ACAATT- - 
ATTA-AGTTAG-ACAATTT- 
ATTG- ACTTGG - AC AATA- - 
ATTG - A.CTTGG - AC AAGA- - 
ATTG- ACTTGG -ACAAT A- - 
ATTG- ACTTGG -ACAATA- - 
ATTG-ACTTGG-ACAACA- - 
ATTA- AATTAG - AC AAAA- - 

ATTA-CATTAG - ACAAA 

ATTA - G ATTAG - ACAAA 

ATTA- GATTAG - AC AA.A 

ATTA-GCTTGG-ACAA 

ATTA- GCTTGG - ACAA 

ATTG- ACTTGG - ACA-AC- - 
ATTG - ACTTGG - AC AAAC - - 
ATTG- ACTTGG -ACAATT- - 
ATTG- ACTTGG - ACAATT- - 
ATTG- ACTTGG -ACAATA- - 
ATTG - ACTTGG - ACAATT- - 
ATTG- ACTTGG - ACA JGGT - - 
ATCATCAT- - G - ATATTT- - 
ATC ATC AT - - G - ATATTT- - 
ATATC AAT AAGCG 



- -TAA.A.ATA- ATATTT A.TTT 

- - TAAAATA- ATCATTATTT 

- - TAAAATA- ATT ATT ATTT 

- - TAuAAATA - ATAATTATTT 

- -T.AAAAT A- ATAATTATTT 

- -TAAAAT TATTATTT 

- -TAAAAT TATTATTT 

- -TAAAAT TATTATTT 

- - TAAAAT TATTATTT 

- -TAAAAT TATTATTT 

- - TAAAAT TATTATTT 

- - TAAAAA TT ATT A.TTT 

- - T/\AAAA TT ATT A.TTT 

- - AAAAAT TATTATTT 

- - TAAAAA TTATTATTT 

- -TAAAAA TTATTATTT 

- -TAAAAAA- -TTATTATIT 

- - TAAAAAA - -TTATTATTT 

- - TT AAA TTT ATGATCT 

- - TT AAA TTT ATCATCT 

- - TT AAA TTT ATGATCT 

- -TTAAA TTTATCATGT 

- - TTAAA TTT ATGATCT 

- - TTAAA TTT ATCATCT 

- - TTAAA TTT ATCATCT 

- -TTCAA TTTATCATCT 

- -TTAAA TTTATCATCT 

- - TAAAAA- - TTTATCATCT 

- -TCTAA TTTATCATCT 

- -TCTAA TTTATCATCT 

- - TTAAAAA- TTTATCATCT 

ACA TTT ATCATCT 

ACA TTTATCATCT 

ACA TTT ATG A.TCT 

ACA TTTATCATCT 

- - TT ACA TTTATCATCT 

AA TTT AACATCT 

AAT - - TIT A-CATCT 

AATA TTTATCATCT 

- - TT AAATT - TTTATCATCT 

AAATA- TTTATCA TCT 

CTCTTTATCATTT 

- -TT AAATT- TTTATCATCT 

- -TCACATC - TTT ATCATCT 

CACA- -CTTT ATCATCT 

CACA- -CTTTA.TGATCT 

CACA - -CTTT ATCATCT 

- - CTACA TTTATCATCT 

- -CTACA TTTATCATCT 

T AC TTATCATCT 

TAC TTATCATCT 

TA TTTATCATCT 

TA TTTATCATCT 

TA TTTA.TCATCT 

TA TTTATCATCT 

TA TTTATCATCT 

TTCAACAAT 



V 6 TATC 

RGR-59 TATG 

92(!7 TATG 

NO TATCJ 

03 06 TATG - 

Gin TATG 

AJllUl TATG - - - 

G4 TATG 

RGR39 TATG 

L91S TATG ^ 

RGR3 8 TATG 

SLP33 TATG 

9215512 TATG " 

Kl TATG 

9215513 TATG - 

Sdos TATG 

R54 TATG 

R56 TATG 

R65 TATG 

55 GATCTCAAATCAGGTTAGGA CTACCCGCTaAATTACT 

56 GATCTCAAACCAGGTTAGGA CTACCCGCTGAATTACT 

Fl GATCTCAAATC^GGT-TGGG CTACCCGCTCAACTTAAGCC ATA 

F2 GCTCTCAAATCAGGT-AGGA CTACCCGCT 

F3 GATCTCAAATC=.GGr-AGGA CTACCCGCT 

F5 GATCTCAAATCAGTCaiJGGA CTACCCGCT 

F7 GATCTCAAATCAGGT-AGGA CT 

F6 aATCTCAAATC^GGrr-AGAT 

51 GATCTCAAATCAGGTr-AGGA CTACCCGCTCAACTTAAGCA TATCAACA 

A2 aATCTCAAATCAGGT-At5GA CTACCCGCTGAACTTAAGCA TATCAATAAGCGGA 

Al GATCTCAAATCAGGT-AGGA CTACCCGCTGAACTTAAGCA TATCAATAAGCGGA 

B GATCTCAAATCACOT-AG-- ATA--CGCCTa-.CTAA 

AGK GATCTCAAATCAGGT-AGGA CTACCCGCTGAA^CrTAAGCA TATCA.\T 

AGN GATCTCAAATCAGGT-AGGA CTACCCGCTGAACTTAAGCA TATCAATAAGCGGA 

AGP GATCTCAAATCAGGT-AGGA CTACCCGCTGAACTTAAGCA TATCAATAAGCGGAGGA 

AGBO GATCTCAAATCAGCTT-AGG- 

AGBB C;ATCTCAAATCA(OT-AGG- 

53 GATCTCAAACCAGGT-ACGA CTACCC 

Ar*J GATCTCAAATCAGGT-AGGA CTACC 

AGG GATCTCAAATCAGGT-AGG- — 

C5 GATCTCAAATCACKTr-AGGA CTACCCGCTGAACTTACCT- 

C8 GA.TCTCAAATCAGGT-AGGA CTACCCGCTGAACTTACTAG 

C2 GATCTCAAATCAGGT-AGA- -TACCG 

C4 GATCTCAAATCAGGT-AGGA CT 

C6 GATCTCAAATCAGGT-AGGA CTACCCGCTGAACTACT 

CI GATCTCAAATCnGGT-GGGA CTACCCGCTGAAC 

C^ 1 C^TCTCAAATCAGGT-CXKSA CTACCCGCTGAACTTAAAGC ATATCA-Ti^^ 

CI 2 GATCTCAAATCAGGT-GGGA CTACCCGCTGAACTTAAAGC ATATCA-TAG 

C13 GATCICAAATCAGGT-GGC^ CTACCCGCTGAACTTAAAGC ATATCAATAG 

52 GATCTCAAATCAGGT-AGGA CTACCC 

54 GATCTCAAATO.GGT-AGGA CTACXTC 

AGL GATCTCAAATCAGGT-AGGA CTACCCGCTa^LACTTAAGCA TATCAATAAGCGGAG- - 

AGI GATCTCAAATCAGGT-AGGA CTACCCG 

AGC GATCTCA.^ATCAGGT-AGG- 

ACVi GATCTCAAATCA.G 

CAG2 a=.TCTCAAATCAG 

AGD GATCTCAAATCACXTT-AGG- 

AGE GATCTCAAATCaGGT-AGG- 

AGR C^ACCTCa^ATCAGGC-GGGA CTACC CGCTa\ACTTAAGCA TATCAATAAGCGGAGGA 

AGS GACCTCGA.^TC=.GGC-GGGA CTACC CGCTGCCCTTAAGCA TATCAATAAGCGGAGGA 

THARZIANU 



end; 




FIGLTR£ 1 

Scheme representing the Internal Transcriber spaces (ITS) region of the ribosomal DNA. The 
shadowed boxes represent the ribosomal genes. The arrows denote the position of the PGR and 
sequencing primers. 
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